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ABSTRACT 

A  myriad  of  social  media  services  are  emerging  in  recent  years  that  allow  people  to  communicate  and  express 
themselves  conveniently  and  easily.  The  pervasive  use  of  social  media  generates  massive  data  at  an  unprecedented 
rate.  It  becomes  increasingly  difficult  for  online  users  to  find  relevant  infonnation  or,  in  other  words,  exacerbates  the 
information  overload  problem.  Meanwhile,  users  in  social  media  can  be  both  passive  content  consumers  and  active 
content  producers,  causing  the  quality  of  user-generated  content  can  vary  dramatically  from  excellence  to  abuse  or 
spam,  which  results  in  a  problem  of  infonnation  credibility.  Trust,  providing  evidence  about  with  whom  users  can 
trust  to  share  information  and  from  whom  users  can  accept  infonnation  without  additional  verification,  plays  a  crucial 
role  in  helping  online  users  collect  relevant  and  reliable  information.  It  has  been  proven  to  be  an  effective  way  to 
mitigate  information  overload  and  credibility  problems  and  has  attracted  increasing  attention. 

As  the  conceptual  counterpart  of  trust,  distrust  could  be  as  important  as  trust  and  its  value  has  been  widely  recognized 
by  social  sciences  in  the  physical  world.  However,  little  attention  is  paid  on  distrust  in  social  media.  Social  media 
differs  from  the  physical  world  -  (1)  its  data  is  passively  observed,  large-scale,  incomplete,  noisy  and  embedded  with 
rich  heterogeneous  sources;  and  (2)  distrust  is  generally  unavailable  in  social  media.  These  unique  properties  of 
social  media  present  novel  challenges  for  computing  distrust  in  social  media:  (1)  passively  observed  social  media 
data  does  not  provide  necessary  infonnation  social  scientists  use  to  understand  distrust,  how  can  I  understand  distrust 
in  social  media?  (2)  distrust  is  usually  invisible  in  social  media,  how  can  I  make  invisible  distrust  visible  by 
leveraging  unique  properties  of  social  media  data?  and  (3)  little  is  known  about  distrust  and  its  role  in  social  media 
applications,  how  can  distrust  help  make  difference  in  social  media  applications? 

The  chief  objective  of  this  dissertation  is  to  figure  out  solutions  to  these  challenges  via  innovative  research  and  novel 
methods.  In  particular,  computational  tasks  are  designed  to  {\it  understand  distrust},  a  innovative  task,  i.e.,  {\it 
predicting  distrust}  is  proposed  with  novel  frameworks  to  make  invisible  distrust  visible,  and  principled  approaches 
are  develop  to  {\it  apply  distrust}  in  social  media  applications.  Since  distrust  is  a  special  type  of  negative  links,  I 
demonstrate  the  generalization  of  properties  and  algorithms  of  distrust  to  negative  links,  i.e.,  {\it  generalizing 
findings  of  distrust} ,  which  greatly  expands  the  boundaries  of  research  of  distrust  and  largely  broadens  its 
applications  in  social  media. 
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ABSTRACT 


A  myriad  of  social  media  services  are  emerging  in  recent  years  that  allow  people  to 
communicate  and  express  themselves  conveniently  and  easily.  The  pervasive  use  of 
social  media  generates  massive  data  at  an  unprecedented  rate.  It  becomes  increasingly 
difficult  for  online  users  to  find  relevant  information  or,  in  other  words,  exacerbates  the 
information  overload  problem.  Meanwhile,  users  in  social  media  can  be  both  passive 
content  consumers  and  active  content  producers,  causing  the  quality  of  user-generated 
content  can  vary  dramatically  from  excellence  to  abuse  or  spam,  which  results  in  a 
problem  of  information  credibility.  Trust,  providing  evidence  about  with  whom  users 
can  trust  to  share  information  and  from  whom  users  can  accept  information  without 
additional  verification,  plays  a  crucial  role  in  helping  online  users  collect  relevant  and 
reliable  information.  It  has  been  proven  to  be  an  effective  way  to  mitigate  information 
overload  and  credibility  problems  and  has  attracted  increasing  attention. 

As  the  conceptual  counterpart  of  trust,  distrust  could  be  as  important  as  trust  and 
its  value  has  been  widely  recognized  by  social  sciences  in  the  physical  world.  However, 
little  attention  is  paid  on  distrust  in  social  media.  Social  media  differs  from  the 
physical  world  -  (1)  its  data  is  passively  observed,  large-scale,  incomplete,  noisy  and 
embedded  with  rich  heterogeneous  sources;  and  (2)  distrust  is  generally  unavailable 
in  social  media.  These  unique  properties  of  social  media  present  novel  challenges  for 
computing  distrust  in  social  media:  (1)  passively  observed  social  media  data  does 
not  provide  necessary  information  social  scientists  use  to  understand  distrust,  how 
can  I  understand  distrust  in  social  media?  (2)  distrust  is  usually  invisible  in  social 
media,  how  can  I  make  invisible  distrust  visible  by  leveraging  unique  properties  of 
social  media  data?  and  (3)  little  is  known  about  distrust  and  its  role  in  social  media 
applications,  how  can  distrust  help  make  difference  in  social  media  applications? 

The  chief  objective  of  this  dissertation  is  to  figure  out  solutions  to  these  challenges 


via  innovative  research  and  novel  methods.  In  particular,  computational  tasks  are 
designed  to  understand  distrust ,  a  innovative  task,  i.e.,  predicting  distrust  is  proposed 
with  novel  frameworks  to  make  invisible  distrust  visible,  and  principled  approaches 
are  develop  to  apply  distrust  in  social  media  applications.  Since  distrust  is  a  special 
type  of  negative  links,  I  demonstrate  the  generalization  of  properties  and  algorithms  of 
distrust  to  negative  links,  i.e.,  generalizing  findings  of  distrust,  which  greatly  expands 
the  boundaries  of  research  of  distrust  and  largely  broadens  its  applications  in  social 
media. 
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Chapter  1 


INTRODUCTION 


Social  media  greatly  enables  people  to  participate  in  online  activities  and  shatters 
the  barrier  for  online  users  to  create  and  share  information  in  any  place  at  any  time. 
The  pervasive  use  of  social  media  generates  massive  data  in  an  unprecedented  rate  and 
the  information  overload  problem  becomes  increasingly  severe  for  social  media  users. 
Meanwhile  the  quality  of  user-generated  content  can  vary  dramatically  from  excellence 
content  to  abuse  or  spam,  resulting  in  a  problem  of  information  credibility  [102,  9]. 
The  study  and  understanding  of  trust  can  lead  to  an  effective  approach  to  address  both 
information  overload  and  credibility  problems.  Trust  refers  to  a  relationship  between 
a  trustor  (the  subject  that  trusts  a  target  entity)  and  a  trustee  (the  entity  that  is 
trusted)  [22],  In  the  context  of  social  media,  trust  provides  evidence  about  with  whom 
we  can  trust  to  share  information  and  from  whom  we  can  accept  information  without 
additional  verification.  With  trust,  we  make  the  mental  shortcut  by  directly  seeking 
information  from  trustees  or  trusted  entities,  which  serves  a  two-fold  purpose:  without 
being  overwhelmed  by  excessive  information  (i.e. ,  mitigated  information  overload)  and 
with  credible  information  clue  to  the  trust  placed  on  the  information  provider  (i.e., 
increased  information  credibility).  Therefore,  trust  is  crucial  in  helping  social  media 
users  collect  relevant  and  reliable  information,  and  computing  trust  in  social  media 
has  attracted  increasing  attention  in  recent  years  [87]. 

Comparing  with  trust,  little  attention  is  paid  to  distrust  in  social  media.  Research 
with  only  trust  may  be  biased  without  considering  distrust  [23,  83].  Typically,  trust 
relations  can  be  represented  as  a  trust  network  and  the  adjacency  matrix  is  adopted 
to  represent  the  trust  network  where  “0”  is  used  to  indicate  no  trust  as  illustrated 
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(B)  The  Adjacency  Matrix 


Network  in  (A) 

Network  in  (C) 

a 

0.183 

0.223 

b 

0.184 

0.213 

c 

0.263 

0.223 

d 

0.184 

0.171 

e 

0.186 

0.171 

(C)  A  Distrust  Link  from  a  to  d 


(D)  Status  Scores 


Figure  1.1:  The  Impact  of  Distrust. 


in  Figure  1.1  (A)  and  (B).  However,  this  representation  may  not  be  representative 
since  a  zero  score  cannot  distinguish  between  distrust  and  no  trust.  For  example, 
a  distrust  relation  may  exist  from  node  a  to  node  d  as  shown  in  Figure  1.1  (C). 
Furthermore,  ignoring  distrust  in  online  applications  may  lead  to  over-estimation  of 
the  effects  of  trust  [93].  The  first  column  of  Table  (D)  in  Figure  1.1  shows  reputation 
scores,  calculated  using  PageRank  [61],  of  nodes  in  the  network  of  Figure  1.1  (A); 
while  the  second  column  shows  reputation  scores,  calculated  by  a  variant  of  Pagerank 
taking  into  account  distrust  [93],  of  nodes  in  the  network  of  Figure  1.1  (C).  The  only 
difference  between  networks  in  Figure  1.1  (A)  and  (C)  is  a  distrust  relation  from  a  to 
d  in  the  network  (C);  clearly,  the  small  difference  significantly  affects  the  statuses  of 
the  nodes.  These  findings  suggest  that  distrust  could  be  as  important  as  trust. 
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1 . 1  Research  Challenges 


Social  scientists  have  studied  distrust  in  the  physical  world  in  reducing  uncertainty 
and  vulnerability  associated  with  critical  decisions  [63,  3].  Social  media  differs  from 
the  physical  world  and  computing  distrust  in  social  media  face  challenges: 

•  Social  scientists  understand  distrust  from  the  perspectives  of  formation  mecha¬ 
nisms  and  constructs  [40,  12].  One  understanding  is  that  distrust  is  the  negation 
of  trust  while  an  alternative  understanding  is  that  distrust  has  added  value  over 
trust.  There  is  still  no  consensus  about  the  understanding  of  distrust  in  social 
sciences  [63,  3,  29].  However,  understanding  distrust  with  social  media  data  is 
inherently  difficult.  Social  media  data  is  based  on  passive  observations  with  a 
large  number  of  online  users  and  lacks  of  necessary  information  social  scientists 
use  to  study  distrust  where  interactions  with  users  are  required.  Since  methods 
from  social  sciences  are  not  directly  applicable,  the  first  challenge  is  how  we  can 
understand  distrust  in  social  media. 

•  It  is  suggested  in  research  [37,  24]  that  trust  is  a  desired  property  while  distrust 
is  an  unwanted  one  for  an  online  social  community.  Therefore,  various  online 
services  such  as  Ciao1,  eBay2  and  Epinions3  implement  trust  mechanisms  to 
help  users  to  better  use  their  services,  but  few  of  them  allow  online  users  to 
specify  distrust  relations.  Since  distrust  is  usually  unavailable  in  social  media, 
the  second  challenge  is  how  we  can  make  unavailable  distrust  available  in  social 
media? 

•  An  ultimate  assessment  of  the  utility  of  distrust  is  its  impact  on  real-world 

ffittp:/ /www.  ciao.  co.uk/ 

2http:/ /www.  ebay,  com/ 

3http:/ /www. epinions. com/ 
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applications.  We  observe  successful  applications  of  trust  such  as  node  classi¬ 
fication  [65,  19]  and  social  recommendation  [82],  Social  theories  such  as  ho- 
mophily  [56]  and  social  influence  [50]  serve  as  the  groundwork  in  many  trust 
applications;  but  they  may  not  applicable  to  distrust  [83].  Since  we  know  too 
little  about  distrust  and  applying  distrust  may  not  be  carried  out  by  simply 
extending  these  of  trust,  the  third  challenge  is  how  we  can  apply  distrust  in 
social  media  applications? 


1.2  Contributions 

The  aforementioned  challenges  present  a  series  of  interesting  research  questions  - 

(1)  is  distrust  the  negation  of  trust?  and  does  distrust  has  added  value  over  trust? 

(2)  can  invisible  distrust  be  discovered?  (3)  how  can  distrust  help  make  difference 
in  social  media  applications?  and  (4)  is  research  of  distrust  generalizable  to  negative 
links  in  social  media?  One  of  the  chief  objectives  of  this  dissertation  is  to  figure 
out  answers  to  these  questions  via  innovative  research.  The  contributions  of  this 
dissertation  are  summarized  as: 

•  The  unique  properties  of  social  media  determines  that  innovative  methods 
should  be  developed  in  order  to  understand  distrust.  We  design  two  compu¬ 
tational  tasks  by  leveraging  data  mining  and  machine  learning  techniques  to 
enable  the  computational  understanding  of  distrust  in  social  media.  The  first 
task  is  to  predict  distrust  from  only  trust,  which  is  designed  to  seek  an  answer 
for  the  question  of  “is  distrust  the  negation  of  trust?” ;  and  the  second  task  is  to 
predict  trust  with  information  from  distrust,  which  is  designed  for  the  question 
of  “does  distrust  have  added  value  over  trust?” 

•  We  propose  a  new  research  task,  i.e.,  predicting  distrust ,  which  aims  to  auto- 
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matically  predict  distrust  when  distrust  is  unavailable  by  leveraging  available 
sources  in  social  media.  We  make  a  number  of  important  findings  about  distrust 
and  develop  an  unsupervised  framework  dTrust  and  a  supervised  framework 
NeLP,  which  can  predict  distrust  accurately  by  using  trust  and  content-centric 
user  interactions. 

•  We  provide  principled  approaches  to  exploit  distrust  for  social  media  applica¬ 
tions,  i.e.,  applying  distrust.  In  detail,  we  use  node  classification  and  recom¬ 
mendation  as  two  representative  applications  to  illustrate  the  importance  and 
the  added  value  of  distrust  in  social  media  applications. 

•  We  generalize  research  about  distrust  to  negative  links,  i.e.,  generalizing  findings 
of  distrust.  In  particular,  we  find  that  properties  and  algorithms  of  distrust  can 
be  generalized  to  negative  links,  which  greatly  expands  the  boundaries  of  the 
research  of  computing  distrust  and  broadens  its  applications. 

1.3  Organization 

The  remainder  of  this  dissertation  is  organized  as  follows.  In  Chapter  2,  we  in¬ 
troduce  some  basic  concepts,  our  research  on  computing  trust  in  social  media,  and 
background  of  distrust  in  social  science.  In  Chapter  3,  we  first  introduce  possi¬ 
ble  representations  of  distrust,  then  investigate  properties  of  distrust  and  detail  two 
computational  tasks  to  seek  answers  for  the  questions  of  “is  distrust  the  negation  of 
trust?”  and  “does  distrust  have  added  value  over  trust?” .  In  Chapter  4,  we  formally 
define  the  problem  of  distrust  prediction,  perform  analysis  on  distrust  and  introduce 
the  details  about  the  unsupervised  framework  dTrust  and  the  supervised  framework 
NeLP.  In  Chapter  5,  we  study  two  representative  applications  of  distrust  with  a  frame¬ 
work  NCSSN  for  node  classification  and  a  framework  RecSSN  for  recommendation.  In 
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Chapter  6,  we  investigate  the  generalization  of  properties  of  distrust  to  negative  links, 
expand  dTrust  and  NeLP  to  predict  negative  links  and  apply  application  frameworks 
NCSSN  and  RecSSN  for  negative  links.  We  conclude  the  dissertation  and  point  out 
broader  impacts  and  promising  research  directions  in  Chapter  7. 
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Chapter  2 


FOUNDATIONS  AND  PRELIMINARIES 


Before  studying  distrust  in  social  media,  we  have  prepared  ourselves  with  research 
on  computing  trust  in  social  media.  In  this  section,  we  will  briefly  introduce  our 
research  on  computing  trust  in  social  media  first,  and  then  give  background  about 
distrust  research  in  social  sciences. 

2.1  Computing  Trust  in  Social  Media 

There  are  three  major  computational  tasks  for  trust  -  representing  trust,  measur¬ 
ing  trust  and  applying  trust  [87,  89].  Below  we  introduce  the  research  we  have  done 
for  computing  trust  in  social  media. 

Representing  Trust:  It  aims  to  represent  trust  relations  among  users  and  trust 
representations  can  be  roughly  divided  into  probabilistic  representations  and  gradual 
representations  [94],  Traditional  trust  representation  considers  trust  as  a  single  con¬ 
cept,  however,  trust  is  a  complex  concept  with  multiple  dimensions.  Therefore,  we 
propose  two  multi-dimensional  trust  representations  -  mTrust  [78]  and  eTrust  [79,  75]. 
Trust  is  context  dependent  and  trusting  someone  on  one  topic  does  not  necessarily 
mean  he  will  be  trusted  on  others,  hence,  mTrust  is  proposed  to  capture  multi-faceted 
trust,  i.e.,  trust  relations  under  different  contexts.  As  humans  interact,  trust  evolves 
and  eTrust  is  proposed  to  capture  trust  evolution. 

Measuring  Trust:  It  measures  how  much  a  certain  user  can  be  trusted  by  other 
users  from  the  community.  From  different  perspectives,  trust  metrics  can  be  classified 
differently  [87].  From  a  personalization  perspective,  trust  metrics  can  be  classified 
as  global  [30]  and  local  trust  metrics  [21].  From  a  methodology  perspective,  trust 
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metrics  can  be  supervised  [59]  or  unsupervised  [108].  From  a  network  perspective, 
trust  metrics  can  be  binary  or  continuous  [87].  A  few  factors  can  influence  people 
to  establish  trust  relations  and  a  user  usually  establishes  trust  relations  with  a  small 
proportion  of  users  in  the  trust  network,  resulting  in  the  adjacent  matrix  very  sparse 
and  low-rank,  hence  users  can  have  a  more  compact  but  accurate  representation  in 
a  low-rank  space.  hTrust  is  an  unsupervised  trust  measurement  framework  based  on 
low-rank  matrix  factorization  [77].  Online  trust  relations  follow  a  power  law  distri¬ 
bution,  suggesting  that  a  small  number  of  users  specify  many  trust  relations  while  a 
large  proportion  of  users  specify  a  few  trust  relations.  The  power  law  distribution  in¬ 
dicates  that  the  available  trust  relations  may  not  be  enough  to  guarantee  the  success 
of  existing  trust  measurements.  On  the  other  hand,  there  are  many  theories  devel¬ 
oped  to  explain  the  formation  of  trust  such  as  homophily  [56]  and  status  theory  [39] , 
and  these  social  theories  may  be  helpful  to  mitigate  the  data  sparsity  problem.  We 
propose  hTrust  [77]  and  sTrust  [97]  to  exploit  homophily  effect  and  status  theory  to 
improve  trust  measurements  for  users  with  few  or  no  trust  relations.  User  preferences 
may  evolve  over  time  and  we  also  consider  temporal  dynamics  in  trust  metrics  [7]. 

Applying  Trust:  It  aims  to  incorporate  trust  to  help  social  media  applications. 
Recommendation  is  one  of  the  most  popular  and  important  applications  of  trust  [53, 
82,  2],  Existing  trust-aware  recommendation  systems  can  be  divided  into  memory- 
based  methods  [21,  27]  and  model-based  methods  [46,  48].  In  the  physical  world, 
people  seek  recommendations  from  their  trusted  friends  and  they  are  also  likely  to 
accept  recommendations  from  trustworthy  users.  Therefore  we  proposed  LOCABEL 
to  exploit  local  and  global  trust  for  recommendation  [81].  Users  may  think  reviews 
from  their  trusted  users  more  useful  and  CAP  is  proposed  to  apply  trust  in  review 
recommendation  in  online  review  websites  such  as  eBay  [76].  An  comprehensive 
overview  about  trust-aware  reconmicnder  systems  can  be  found  in  our  survey  paper 


in  [82], 


2.2  Distrust  in  Social  Sciences 

In  social  sciences,  the  conceptual  counterpart  of  trust,  distrust,  is  considered  as 
important  and  complex  as  trust  [54,  37,  24,  13].  For  example,  [68,  12]  claim  that 
trust  and  distrust  help  a  decision  maker  reduce  uncertainty  and  vulnerability  (i.e., 
risk)  associated  with  decision  consequences;  and  [13]  indicates  that  only  distrust 
can  irrevocably  exclude  services  from  being  selected  at  all.  There  is  an  enduring 
problem  about  distrust  -  what  is  the  relation  between  trust  and  distrust.  Answering 
this  question  has  its  significance.  If  trust  and  distrust  are  the  same,  lack  of  distrust 
research  matters  little,  however,  if  they  are  different,  the  lack  of  distrust  research  could 
be  problematic  because  distrust  may  have  unique  impact.  Some  researchers  believe 
distrust  simply  means  a  low  level  of  trust,  hence  evidence  of  high  trust  was  always 
regarded  as  being  that  of  low  distrust,  and  outcomes  of  high  trust  would  be  identical  to 
those  of  low  distrust  [63,  3,  29].  Others  believe  distrust  is  a  concept  entirely  separate 
from  trust  [40,  36].  Therefore  distrust  and  trust  can  coexist,  and  they  have  different 
antecedents  and  consequents  [55].  For  example,  in  [40],  three  reasons  are  proposed 
to  prove  that  trust  and  distrust  are  separate  -  (1)  they  separate  empirically;  (2)  they 
coexist;  and  (3)  they  have  different  antecedents  and  consequents.  There  is  still  no 
consensus  answer  about  this  problem,  and  some  social  scientists  consider  distrust  as 
the  “darker”  side  of  trust  [51]. 
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Chapter  3 


UNDERSTANDING  DISTRUST  IN  SOCIAL  MEDIA 


Social  scientists  understand  distrust  from  the  perspectives  of  formation  mech¬ 
anisms  and  constructs  [40,  12]  and  have  recognized  that  distrust  helps  a  decision 
maker  reduce  the  uncertainty  and  vulnerability  (i.e.,  risk)  associated  with  decision 
consequences  [12],  and  in  some  cases,  plays  a  more  critical  role  than  trust  [68,  54], 
Large-scale  social  media  data  does  not  contain  necessary  information  social  scientists 
ascribe  in  their  studying  distrust.  In  social  media,  distrust  is  embedded  with  rich  but 
passively  observed  user-generated  content  and  interactions.  Understanding  distrust 
in  social  media  requires  new  methods  since  those  from  social  sciences  are  not  directly 
applicable,  but  can  be  helpful  in  our  search  of  new  methods.  More  specifically,  we  aim 
to  answer  the  following  research  questions:  (1)  what  are  the  properties  of  distrust? 
(2)  is  distrust  the  negation  of  trust?  and  (3)  does  distrust  have  added  value  over 
trust? 

3.1  Representing  Distrust 

Computational  models  for  trust  depends  on  certain  trust  representations  [87]; 
hence  an  immediate  question  for  distrust  is  how  to  represent  distrust.  We  propose  to 
represent  distrust  with  trust,  which  can  not  only  avoid  possible  biases  because  of  the 
ignorance  of  trust  in  distrust  representations  but  also  help  us  better  understand  the 
role  and  added  value  of  distrust  over  trust  in  applications. 

Distrust  representations  are  substantially  different  with  different  understandings 
about  distrust  over  trust.  If  distrust  is  considered  as  the  negation  of  trust  [63,  3], 
high  (or  low)  trust  would  be  identical  to  those  of  low  (or  high)  distrust  [29].  In  this 


10 


(1,0) 


O  °'8  o 


•  08  • 

l  : 

•  ^  » 


O  |0'8,0I(» 


(a)  Weighted  Unsigned 
Network 


(b)  Signed  Network 


(c)  Two-dimensional 
Network 


Figure  3.1:  Network  Understandings  of  Representations  of  Trust  and  Distrust. 


case,  we  represent  trust  and  distrust  are  two  ends  of  the  same  conceptual  spectrum. 
If  distrust  is  not  the  negation  of  trust  [40,  36],  there  are  two  views  about  the  relations 
between  trust  and  distrust  as: 

•  Trust  and  distrust  are  viewed  as  tightly  related  features  in  a  single  structure  [38] . 
Hence  we  add  positive  and  negative  signs  to  represent  trust  and  distrust  respec¬ 
tively,  and  we  keep  the  semantics  of  a  zero  score  in  the  representation;  and 

•  Distrust  is  viewed  as  a  distinct  dimension  from  trust  about  users  [69].  Hence 
we  add  a  new  dimension  about  users  to  represent  distrust. 

To  further  understand  aforementioned  three  representations  better,  we  show  these 
representations  from  the  network  perspective  as  demonstrated  in  Figure  3.1.  When 
we  consider  distrust  as  low  trust,  the  trust  and  distrust  network  is  a  weighted  unsigned 
network  as  shown  in  Figure  3.1(a);  when  we  add  signs  to  represent  trust  and  distrust, 
the  resulting  network  is  a  weighted  signed  network  as  shown  in  Figure  3.1(b);  while 
we  add  a  new  dimension  to  represent  distrust,  the  trust  and  distrust  network  is  a 
weighted  multi-dimensional  unsigned  network  as  shown  in  Figure  3.1(c). 

Let  U  =  {wi,  U2, . .  ■ ,  un }  be  the  sets  of  users  where  n  is  the  number  of  users.  If  we 
consider  trust  and  distrust  links  as  tightly  related  features  in  a  single  network.  We 
represent  both  trust  and  distrust  links  into  one  adjacency  matrix  F  6  Mnxn  where 
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F ij  =  1,  F,j  =  —  1  and  F^  =  0  denote  trust,  distrust  and  missing  links  from  rq  to 
Uj,  respectively.  If  we  tread  distrust  as  a  distinct  dimension  from  trust  about  users, 
we  use  two  adjacency  matrices  to  represent  trust  and  distrust  links,  respectively.  In 
particular,  it  uses  T  e  Mnxn  to  represent  trust  links  where  T ^  =  1  and  =  0 
denote  a  trust  link  and  a  missing  link  from  Ui  to  Uj.  Similarly  D  e  Mnxn  is  used 
to  represent  distrust  links  where  =  1  and  =  0  denote  a  distrust  link  and  a 
missing  link  from  Ui  to  Uj. 

It  is  easy  to  convert  one  representation  into  the  other  representation  with  the 
following  rules:  F  =  T  — D,  and  T  =  —  and  D  =  —  where  |F|  is  the  absolution 

of  F. 


3.2  Properties  of  Distrust 

Properties  of  trust  such  as  transitivity,  asymmetry  and  homophily  help  determine 
the  value  of  trust [20,  77].  Hence,  to  understand  distrust,  it  is  natural  to  start  with 
exploring  properties  of  distrust.  Naturally,  we  question  if  there  are  some  obvious 
connections  between  these  properties  of  trust  to  distrust.  The  proposed  research  can 
help  us  understand  how  unique  distrust  is  and  the  intrinsic  differences  between  trust 
and  distrust. 

Before  investigations,  we  first  introduce  the  dataset  we  used.  Trust  mechanisms 
are  implemented  by  various  online  services;  however,  few  of  them  allow  users  to 
establish  distrust  relations.  Although  the  product  review  site  Epinions  allows  users 
to  trust  and  distrust  other  users,  distrust  relations  are  unavailable  to  the  public.  For 
the  research  purpose,  a  dataset  with  distrust  relations  was  given  by  Epinions  staff.  We 
preprocess  the  data  by  filtering  users  without  any  trust  and  distrust  relations.  This 
dataset  includes  trust  and  distrust  relations,  user-item  ratings,  user-review  authorship 
relations  and  user-review  helpfulness  ratings.  The  statistics  of  the  dataset  are  shown 
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#  of  Users 

30,455 

#  of  Trust  Relations 

363,773 

#  of  Distrust  Relations 

46,196 

#  of  Users  Receiving  Distrust 

9,513 

#  of  Users  Creating  Distrust 

5,324 

#  of  Items 

89,270 

#  of  Ratings 

562,355 

#  of  Reviews 

1,197,816 

#  of  Helpfulness  Ratings 

10,341,893 

Avg  of  Helpfulness  Rating  Score 

4.7129 

Avg  of  Rating  Score 

3.9053 

Table  3.1:  Statistics  of  the  Ep inions. 

in  Table  3.1. 


3.2.1  Transitivity 

Transitivity  is  a  primary  property  of  trust  and  it  describes  that  trust  can  be  passed 
between  people  [29,  20].  For  example,  if  user  Ui  trusts  user  u3 ,  and  user  Uj  trusts 
user  Uk,  then  transitivity  indicates  that  with  a  high  probability,  user  will  trust  user 
Uk-  Here  we  study  the  property  of  distrust  with  respect  to  transitivity.  Note  that  we 
use  x+y,  x-y,  and  x?y  to  denote  the  observations  of  a  trust,  a  distrust  and  a  missing 
relation  from  user  x  to  user  y,  respectively. 

To  investigate  the  transitivity  property  of  distrust,  we  first  find  all  pairs  of  relations 
(ui-Uj ,Uj-Uk) ,  and  check  whether  Ui  and  Uk  are  with  a  trust  (ui+Uk),  or  a  distrust 
(■ Ui-Uk ),  or  a  missing  relation  ( u{luk ).  We  conduct  a  similar  process  for  trust,  and 
the  results  are  demonstrated  in  Table  3.2.  For  the  first  calculation,  we  consider  all 
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(ui,Uk)  pairs  (i.e.,  Ui+Uk,  Ui-u k,  or  Uihik)  and  use  “PI”  to  denote  the  percentage  of 
pairs  of  (  Uj,Uk)  with  a  trust,  a  distrust  or  a  missing  relation  over  all  pairs.  For  the 
second  calculation,  we  only  consider (ui,Uk)  pairs  with  observed  relations  (i.e.,  trust 
Ui+Uk  or  distrust  u^-Uk),  and  adopt  “P2”  to  represent  the  percentage  of  (  Ui,Uk)  with 
a  trust  or  a  distrust  relation  over  pairs  with  observed  relations  (i.e.,  Ui+Uk  or  Ui-Uk )• 

Golbeck  suggests  that  trust  is  not  perfectly  transitive  in  the  mathematical  sense 
and  is  conditionally  transitive  [20],  which  is  especially  true  for  social  media  data  since 
users  in  social  media  are  world-widely  distributed  and  there  are  many  pairs  of  user 
not  knowing  each  other.  A  trust  relation  Ui+Uk  only  takes  11.46%  (PI)  of  all  pairs  of 
(■ Ui,Uk  )  for  trust.  However,  among  pairs  with  observed  relations,  Ui+Uk  takes  as  high 
as  97.75%  (P2),  which  suggests  the  transitivity  property  for  trust  -  if  u*  establishes 
a  relation  with  it  is  likely  to  be  a  trust  relation.  For  distrust,  the  percentages  of 
Ui-Uk  and  Ui+Uk  are  comparable.  urUk  suggests  transitivity,  which  can  be  explained 
by  status  theory;  while  u^+Uk  can  be  explained  by  balance  theory  [8,  25]  as  “the 
enemy  of  your  enemy  is  your  friend.” 

3.2.2  Asymmetry 

The  asymmetry  of  trust  is  also  important  and  suggests  that  for  two  people  involved 
in  a  relation,  trust  is  not  necessarily  identical  in  both  directions  [22],  For  example, 
if  Ui  trusts  Uj,  one  cannot  infer  that  u3  trusts  Uj.  Next  we  examine  the  property  of 
distrust  in  term  of  asymmetry. 

For  each  trust  relation  Ui+Uj  (or  each  distrust  relation  UrUj),  we  check  the  pos¬ 
sible  relations  from  Uj  to  u*,  and  the  results  are  illustrated  in  Table  3.3.  Note  that 
in  Table  3.3  the  numbers  in  parentheses  are  the  percentages  of  the  corresponding 
relations  over  all  possible  relations.  We  observe  37.71%  mutual  trust  relations,  but 
only  5.86%  mutual  distrust  relations.  These  results  suggest  that  trust  is  asymmetric, 
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Trust 


Types 

Number 

PI 

P2 

25,584,525 

88.34% 

N.A 

(lli+Uj^j+Uk),  Ui+Uk 

3,320,991 

11.46% 

97.75% 

76,613 

0.2% 

2.25% 

Distrust 

Types 

Number 

PI 

P2 

(Ui-UjjUj-ZLfcj)  UiUzLfz 

716,340 

91.70% 

N.A 

(zii-Uj)Uj-Uk^i  Ui~\~Uk 

38,729 

4.96% 

59.73% 

(Ui-tljjUj-Uhj)  Ui~Uk 

26,114 

3.34% 

40.27% 

Table  3.2:  Transitivity  of  Trust  and  Distrust. 

Uj+Ui(%) 

Uj-Ui(% ) 

Ujlh 

*(%) 

Ui+Uj  136,806(37.61) 

967(0.27) 

226,000(62.13) 

Ui-Uj  967(2.09) 

2,623(5.86) 

42,606(92.23) 

Table  3.3:  Asymmetry  of  Trust  and  Distrust. 

and  distrust  is  even  more  asymmetric. 

3.2.3  Similarity 

Ziegler  et  al.  [107]  point  out  that  there  is  a  strong  and  significant  correlation 
between  trust  and  similarity  and  users  with  trust  relations  are  more  similar  than 
those  without.  Next  we  investigate  the  relations  between  distrust  and  similarity.  In 
Epinions,  users  can  give  a  score  from  1  to  5  to  rate  various  items  and  we  define  user 
similarities  as  their  rating  similarities  [77].  We  use  the  following  three  measures  to 
compute  the  similarity  between  Ui  and  Uj  as: 
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Cl 

COSINE 

CI-COSINE 

Distrust  (s^) 

0.4994 

0.0105 

0.0142 

Trust  (st) 

0.6792 

0.0157 

0.0166 

Random  Pairs  (s.r) 

0.1247 

0.0027 

0.0032 

milarity  for  Trust  anc 

l  Distrus 

t. 

•  Cl:  the  number  of  common  items  rated  by  both  Ui  and  u3 ; 

•  COSINE:  the  cosine  similarity  between  the  rating  scores  of  Ui  and  Uj  over  all 
items;  and 

•  CI-COSINE:  the  cosine  similarity  between  the  rating  scores  of  Ui  and  u3  over 
their  commonly  rated  items. 

We  calculate  three  similarities  for  each  distrust  relation,  i.e.,  distrust  similarity  ds, 
trust  similarity  ts,  and  random  similarity  rs.  For  example,  for  the  distrust  relation 
Ui-Uj,  indicating  that  Ui  distrusts  Uj,  ds  is  the  rating  similarity  between  Ui  and  Uj ,  ts 
the  similarity  between  Ui  and  a  randomly  chosen  user  who  has  a  trust  relation  with 
Ui ,  and  rs  the  similarity  between  U{  and  a  randomly  chosen  user  without  a  distrust 
relation  with  Uj.  Over  all  distrust  relations,  finally  we  obtain  three  similarity  vectors, 
Sd,  stl  and  sr.  is  the  set  of  all  distrust  similarities  d,si  st  the  set  of  ts,  and  s,r  the 
set  of  rs.  The  means  of  s^,  st  and  sr  are  shown  in  Table  3.4,  respectively.  We  observe 
that  the  means  of  distrust  are  larger  than  those  of  random  but  smaller  than  those  of 
trust  over  all  three  similarity  measures. 

To  see  the  significance,  we  conduct  a  series  of  two-sample  f-test  on  {s^,  sr},  {si;  sr}, 
and  {st,Srf}.  For  two  vectors  {x,  y},  the  null  hypothesis  H0,  and  the  alternative 
hypothesis  Hi  are  defined  as: 

H0  :  x  <=  y  Hi  :  x  >  y.  (3.1) 
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Cl 

COSINE 

CI-COSINE 

H0  :  sd  <=  sr  Hi 

•  Sd  ^  Sr 

9.57e-87 

1.19e-120 

4.88e-45 

H0  '■  st  <=  sr  Hi 

•  Si  Sr 

1.71e-132 

5.83e-157 

3.72e-108 

Hq  :  St  <=  Sd  Hi 

■  St  >  sd 

7.84e-23 

1.99e-19 

9.32e-17 

Table  3.5:  P- values  for  t-test  Results. 

The  null  hypothesis  is  rejected  at  significance  level  a  =  0.01  with  p-values  shown  in 
Table  3.5.  Means  in  Table  3.4  and  evidence  from  t-test  suggest  that  -  (1)  users  with 
distrust  are  likely  to  be  more  similar  than  those  without;  (2)  users  with  trust  are 
likely  to  be  more  similar  than  those  without;  and  (3)  users  with  trust  are  likely  to  be 
more  similar  than  those  with  distrust. 

3.2.4  Discussion 

Aforementioned  empirical  investigations  indicate  that  distrust  is  not  transitive, 
highly  asymmetric  and  neither  similarity  nor  dissimilarity.  Through  this  comparative 
study  on  properties  of  trust  and  distrust,  we  can  conclude  that  (1)  the  properties  of 
trust  cannot  be  extended  to  distrust;  and  (2)  distrust  presents  distinct  properties. 

3.3  Constructing  Distrust  from  Only  Trust 

Some  social  scientists  believe  distrust  as  the  negation  of  trust  -  trust  and  distrust 
are  two  ends  of  the  same  conceptual  spectrum,  and  distrust  can  be  suggested  by  low 
trust  [63,  3].  To  seek  an  answer  to  the  question  of  “is  distrust  the  negation  of  trust?”, 
we  design  the  task  of  constructing  distrust  from  only  trust.  The  reasoning  behind  this 
task  is  if  distrust  is  the  negation  of  trust,  distrust  can  be  suggested  for  pairs  of  users 
with  low  trust  and  we  can  accurately  construct  distrust  from  only  trust.  Therefore  the 
problem  of  constructing  distrust  from  only  trust  boils  down  to  the  problem  of  predicting 


17 


low  trust  with  trust.  Trust  scores  of  pairs  of  users  in  the  same  trust  network  can  be 
computed  via  existing  trust  prediction  algorithms. 

The  general  framework  for  the  task  is  shown  in  Algorithm  1.  The  input  of  the 
framework  includes  trust  T  and  a  trust  predictor  /.  For  a  pair  of  users  without  trust 
(■ Ui,Uj ),  we  use  /  to  predict  a  trust  score  T ^  from  Ui  to  Uj  and  then  suggest  pairs 
with  low  trust  scores  as  distrust.  We  choose  two  representative  trust  predictors  - 
trust  propagation  [23]  and  the  one  we  proposed  in  [77]  based  on  matrix  factorization. 

Algorithm  1  The  framework  of  Task  1  to  predict  distrust  from  only  trust 
Input:  User-user  trust  relation  matrix  T,  and  a  trust  predictor  / 

Output:  Ranking  list  of  pairs  of  users 

1:  for  Each  pair  of  users  without  trust  (ut,Uj)  do 

2:  Predicting  the  trust  score  of  Ty-  from  U{  to  Uj  by  / 

3:  end  for 

4:  Ranking  pairs  of  users  (e.g.,  ( Ui,Uj ))  according  to  Ty-  in  an  ascending  order. 


3.3.1  Trust  Prediction  based  on  Matrix  Factorization 

Trust  has  some  well-known  properties  such  as  transitivity,  asymmetry  and  corre¬ 
lation  with  user  preference  similarity,  which  lay  the  groundwork  for  meaningful  and 
effective  trust  prediction  models.  Let  Uj  G  denote  the  user  preference  vector  of 
Ui .  In  [77],  we  propose  to  model  a  trust  relation  from  ut  to  Uj  as  T ^  =  U,;VUj 
where  V  G  M.dxd  captures  the  correlations  among  use  preferences.  We  can  verify  that 
the  proposed  model  can  capture  several  important  properties  of  trust  such  as  tran¬ 
sitivity,  asymmetry  and  similarity.  For  example,  V  could  be  asymmetric,  therefore 
T ij  =  IRVUj  could  be  unequal  to  =  UjVU]1",  which  captures  the  property  of 
asymmetry.  Let  U  =  {Ui,  UL, . . . ,  Un}  be  the  user  preference  matrix.  U  and  V  can 
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be  obtained  via  solving  the  following  low-rank  matrix  factorization  problem: 

min  ||T-UVUT|||  +  a||U|||  +  /?||V|||  (3.2) 

where  terms  ofa:||U|||,-|-/3||V||p  are  introduced  to  avoid  overfitting.  With  the  learned 
U  and  V,  the  estimated  user-user  trust  relation  matrix  T  is  obtained  as  T  =  UVUT. 

3.3.2  Evaluation 

In  this  subsection,  we  conduct  experiments  to  answer  the  following  question:  is 
distrust  the  negation  of  trust?  To  answer  the  question,  we  check  how  accurately  we 
can  predict  distrust  from  only  trust. 


Experimental  Settings 


We  first  introduce  the  experimental  setting  for  this  evaluation.  At  =  {(«*,  Uj)  |Tjj  = 
1}  is  the  set  of  pairs  of  users  with  trust  relations,  and  Ad  =  {(«*,  Uj)\Dij  =  1}  is 
the  set  of  pairs  of  users  with  distrust  relations.  The  pairs  in  both  At  and  Ad  are 
sorted  in  chronological  order  in  terms  of  the  time  when  they  established  relations. 
We  assume  that  until  time  t,  x%  of  pairs  in  At  establish  trust  relations,  denoted  as 


A?i  and  we  use  AXD  to  denote  pairs  of  users  in  Ad  establishing  distrust  until  time 
t.  x  is  varied  as  {50,  55,  60,  65,  70, 80,  90, 100}.  For  each  x,  we  use  A f.  to  predict  AXD 
from  Nj,  where  N%,  is  the  negation  of  A f.  as  shown  in  Figure  3.2. 

We  follow  a  common  metric  for  trust  evaluation  in  [41,  77]  to  assess  the  prediction 
performance.  In  detail,  each  predictor  ranks  pairs  in  N%,  in  ascending  order  of 
confidence  and  we  take  the  first  \AXD\  pairs  as  the  set  of  predicted  distrust  relations, 
denoting  APD.  Then  the  performance  is  computed  as, 


-  _  I  ^D  1?  -Ap  I 
1  “  \A*»\ 


(3.3) 


where  I  •  I  denotes  the  size  of  a  set. 
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Figure  3.2:  Experimental  Settings  for  Constructing  Distrust  from  Only  Trust. 

Experimental  Results 

The  results  are  shown  in  the  Table  3.6.  “dTP”,  “dMF”,  and  “dTP-MF”  and  “Ran¬ 
dom”  in  the  table  are  defined  as  follows: 

•  dTP:  this  distrust  predictor  obtains  trust  scores  for  pairs  of  users  based  on  trust 
propagation,  and  then  suggests  distrust  relations  for  pairs  with  low  trust  scores; 

•  dMF:  this  distrust  predictor  computes  trust  scores  for  pairs  of  users  based  on 
matrix  fatorization,  and  then  predict  pairs  with  low  trust  scores  as  distrust 
relations; 

•  dTP-MF:  this  distrust  predictor  combines  results  of  dTP  and  dMF  to  infer 
distrust  relations;  and 

•  Random:  this  distrust  predictor  randomly  guesses  pairs  of  users  with  distrust 
relations. 

If  distrust  is  the  negation  of  trust,  low  trust  scores  should  accurately  indicate 
distrust.  However,  we  observe  that  the  performance  of  dTP,  dMF  and  dTP-MF 
is  consistently  worse  than  that  of  the  randomly  guessing  (Random).  These  results 
suggest  that  low  trust  scores  cannot  be  used  to  predict  distrust;  hence  distrust  is  not 
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x(%) 

dTP  (xlO  5) 

dMF(xl0  5) 

dTP-MF(x  10-5) 

Random(xl0  5) 

50 

4.8941 

4.8941 

4.8941 

5.6824 

55 

5.6236 

5.6236 

5.6236 

8.1182 

60 

7.1885 

7.1885 

7.1885 

15.814 

65 

11.985 

11.985 

11.985 

19.717 

70 

13.532 

13.532 

13.532 

18.826 

80 

10.844 

10.844 

10.844 

16.266 

90 

12.720 

12.720 

12.720 

25.457 

100 

14.237 

14.237 

14.237 

29.904 

Table  3.6:  Performance  Comparison  of  Predicting  Distrust  from  Only  Trust. 

the  negation  of  trust.  Social  scientists,  who  support  distrust  not  the  negation  of  trust, 
argue  that  pairs  of  users  with  untrust  can  have  very  low  trust  scores  [51],  which  is 
especially  correct  for  users  in  social  media  since  they  are  world-widely  distributed. 

3.4  Trust  Prediction  with  Information  from  Distrust 

An  alternative  understanding  is  that  distrust  is  not  the  negation  of  trust  and 
it  has  added  value  over  trust  [40,  36].  To  seek  an  answer  to  the  question  of  “does 
distrust  have  added  value  over  trust?”,  we  design  the  task  of  trust  prediction  with 
information  from  distrust.  The  intuition  behind  this  task  is  if  distrust  has  added  value 
over  trust,  distrust  should  provide  extra  information  about  users  and  we  shall  be  able 
to  predict  trust  better  with  distrust.  An  illustration  of  the  problem  of  trust  prediction 
with  information  from  distrust  is  shown  in  Figure  3.3  -  the  input  of  traditional  trust 
prediction  is  only  old  trust  relations;  we  propose  to  also  use  distrust  information 
as  shown  in  the  dashed  box  in  Figure  3.3.  In  [23],  two  strategies  are  investigate 
to  incorporate  distrust  into  trust  propagation:  (1)  one  step  distrust  propagation  - 
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Figure  3.3:  Trust  Prediction  with  Information  from  Distrust. 

first  propagating  trust  for  multiple  steps  and  then  propagating  one-step  distrust;  and 
(2)  multiple  step  distrust  propagation  -  propagating  trust  and  distrust  propagate 
together  for  multiple  steps.  Next  we  will  investigate  how  to  incorporate  distrust  into 
the  matrix  factorization  based  trust  prediction  [77]. 

3.4.1  Matrix  Factorization  based  Trust  Prediction  with  Distrust  Information 

We  view  trust  and  distrust  as  tightly  related  features  in  a  single  structure  (or 
a  signed  network)  and  choose  adding  signs  to  represent  trust  and  distrust  as  F  = 
T  —  D.  The  advantages  of  this  representation  are  two-fold.  First,  we  can  extend 
the  matrix  factorization  based  trust  prediction  to  incorporate  distrust  information  as 
F ij  =  UjVUj.  Second,  by  adding  signs,  trust  and  distrust  relations  are  represented 
as  a  signed  network  and  social  theories  for  signed  networks  such  as  balance  theory 
can  be  used  to  facilitate  the  task.  Below  we  introduce  how  to  model  balance  theory. 

We  use  Sij  to  denote  the  sign  of  the  relation  between  Ui  and  Uj  where  sl3  =  1  (or 
Sij  =  —1)  if  we  observe  a  trust  relation  (or  a  distrust  relation)  between  Ui  and  Uj. 
With  these  notations,  balance  theory  suggests  that  a  triad  (ui,Uj,Uk)  is  balanced  if 
-  (1)  =  1  and  Sjk  =  1,  then  Sik  =  1;  or  (2)  =  —1  and  Sjk  =  —1,  then  =  1- 

For  a  triad  ( Ui,Uj,Uk ),  there  are  four  possible  sign  combinations  A(+,+,+),  B (+,+,- 


22 


A:  (+,+,+) 


B:  (+,+,-) 


C: 


D: 


Figure  3.4:  An  Illustration  of  Balance  Theory. 

)  C(+,-,-)  and  According  to  balance  theory,  only  A(+,+,+)  and  C(+,-,-) 

are  balanced.  We  examine  all  triads  in  the  studied  dataset  and  find  that  more  than 
90%  of  them  are  balanced.  This  result  suggests  that  balance  theory  is  a  principle  to 
understand  the  formation  of  trust  and  distrust  relations. 

For  each  user  ?j,; ,  we  introduce  a  one-dimensional  latent  factor  r*  to  model  balance 
theory,  and  we  further  assume  that  the  relation  between  Ui  and  Uj  due  to  the  effect  of 
balance  theory  is  captured  as  F^  =  r^j.  To  show  that  F,j  =  ryr,-  can  capture  balance 
theory,  we  need  to  prove  that  -  (1)  Case  1:  if  signiF^)  =  1  and  signiF  jk)  =  1,  we 
can  prove  that  sign{F  ik)  =  1;  and  (2)  Case  2:  if  signiF^)  =  —1  and  sign(F jk)  =  —1, 
we  can  prove  that  signiF  ik)  =  1. 

Let  us  first  prove  Case  1.  If  signiFij )  =  1  and  signiF  jk)  =  1,  we  have  signir^j )  = 

1  and  sign{rjrk)  =  1;  by  multiplying  signir^j)  and  sign{rjrk),  we  have  sign(rirjrjrrk)  = 
1.  Since  sign(r |)  =  1,  we  get  signir^k)  =  1,  i.e. ,  sign(Fik)  =  1.  We  can  use  a 
similar  process  to  prove  Case  2.  If  signiFij)  =  —1  and  signiF  jk)  =  —1,  we  have 
signiriTj)  =  —1  and  signer jrrk)  =  — 1;  by  multiplying  sign(riVj)  and  sign(rjrk),  we 
have  sign{rirjrjrk)  =  1.  Since  sign(r |)  =  1,  we  get  sig7i{ri'rk)  =  1,  i.e.,  sign(Fik)  =  1. 

The  proposed  framework  disMF  for  the  problem  of  trust  prediction  with  distrust 
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information  is  to  solve  the  following  optimization  problem, 


min  ||F-UVUT-ArrT|||  +  a||U|||  +  /3||V|||  +  ^||r||2,  (3.4) 

U.V.r 

where  r  =  [r i ,  r'2 , . . .  ,  rn] T;  the  term  || r |||  is  introduced  to  avoid  overfitting;  and 
the  parameter  A  is  used  to  control  the  contributions  from  balance  theory.  A  local 
minimum  of  Eq.  (4.9)  can  be  obtained  through  a  gradient  decent  optimization  method. 

3.4.2  Evaluation 

In  this  subsection,  we  conduct  experiments  to  answer  the  following  question:  does 
distrust  have  added  value  in  trust  prediction?  To  answer  the  question,  we  examine 
whether  the  performance  of  trust  prediction  is  improved  by  exploiting  distrust. 

Experimental  Settings 

Before  going  to  the  detailed  evaluation,  we  first  introduce  the  experimental  setting. 
We  use  O  =  {(iq, Uj)\Tij  ^  1}  to  denote  the  set  of  pairs  of  users  without  trust 
relations.  Assume  that  at  time  t,  x%  of  At  have  established  trust  relations.  We 
choose  these  x%  as  old  trust  relations  A and  the  remaining  1  —  x%  as  new  trust 
relations  A J  to  predict.  We  use  AXD  to  denote  the  subset  of  pairs  in  Ad,  which  have 
established  distrust  before  t.  We  vary  x  as  {50,  55,  60,  65,  70,  80,  90}.  For  each  x, 
we  also  repeat  the  experiments  10  times  and  report  the  average  performance.  The 
experimental  setting  is  illustrated  in  Figure  3.5  where  denotes  the  set  of  pairs 
without  trust  relations  at  time  t. 

For  each  x,  we  use  old  trust  relations  At  and  distrust  relations  AXD  to  predict 
new  trust  relations  A y.  Each  predictor  ranks  pairs  in  Nf  in  decreasing  order  of 
conhdence  and  we  take  the  Erst  \Aq\  pairs  as  the  set  of  predicted  trust  relations, 
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Figure  3.5:  Experimental  Setting  for  Trust  Prediction  with  Distrust  Information. 


denoting  as  A Then  the  performance  is  assessed  as, 
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(3.5) 


We  use  disTP-m  and  disTP-s  to  denote  performing  multiple  steps  and  a  single  step 
distrust  propagation  in  trust  propagation  (TP),  and  their  comparison  results  are 
shown  in  Figure  3.6.  The  comparison  results  of  disMF  over  matrix-factorization 
based  trust  prediction  (MF)  are  demonstrated  in  Figure  3.7.  Note  that  “Random”  in 
figures  denotes  the  performance  of  randomly  guessing. 

Let  us  first  examine  the  performance  comparisons  when  x  =  50,  which  are  high¬ 
lighted  in  Figure  3.6  and  Figure  3.7.  We  make  the  following  observations: 


•  For  the  first  column  results  in  Figure  3.6,  both  disTP-s  and  disTP-m  outperform 
TP.  For  example,  disTP-s  obtains  4.28%  relative  improvement  compared  to  TP. 
disTP-s  and  disTP-m  incorporate  distrust  propagation  into  trust  propagation, 
and  the  improvement  is  from  distrust  propagation.  These  results  support  that 
distrust  can  improve  trust  propagation  and  leads  to  the  performance  gain  in 
trust  prediction.  We  also  note  that  most  of  the  time,  disTP-s  with  one-step 
distrust  propagation  outperforms  disTP-m  with  multiple  step  distrust  propaga¬ 
tion. 
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Figure  3.6:  Performance  Comparison  for  Trust  Propagation  without  and  with  Dis¬ 
trust  Information. 


Figure  3.7:  Performance  Comparison  for  the  Matrix  Factorization  based  Method 
without  and  with  Distrust  Information. 
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•  For  the  first  column  results  in  Figure  3.7,  disMF  obtains  better  performance 
than  MF,  and  gains  8.55%  relative  improvement  over  MF. 

For  other  values  of  x,  we  have  similar  observations  -  distrust  can  improve  the  per¬ 
formance  of  trust  prediction.  With  the  help  of  distrust,  we  can  significantly  improve 
the  performance  of  trust  prediction,  which  suggests  that  distrust  has  added  value  over 
trust. 

3.5  Conclusion 

With  aforementioned  investigations,  we  can  draw  a  computational  understanding 
of  distrust  in  social  media  by  answering  the  three  questions  we  asked  at  the  begin¬ 
ning  of  this  chapter.  First,  the  property  investigation  suggests  that  distrust  presents 
distinct  properties  from  trust  and  we  cannot  extend  properties  of  trust  to  distrust. 
Second,  the  task  of  distrust  prediction  with  only  trust  fails  to  predict  distrust  by  us¬ 
ing  low  trust,  which  indicates  that  low  trust  is  not  equivalent  to  distrust  and  distrust 
is  not  the  negation  of  trust.  Third,  the  trust  prediction  performance  is  significantly 
improved  with  distrust  information,  which  supports  that  distrust  has  added  value 
over  trust. 
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Chapter  4 


PREDICTING  DISTRUST  IN  SOCIAL  MEDIA 


It  is  suggested  in  research  [37,  24]  that  trust  is  a  desired  property  while  distrust  is 
an  unwanted  one  for  an  online  social  community.  Intuitively,  various  online  services 
such  as  Ciao1,  eBay2  and  Epinions3  implement  trust  mechanisms  to  help  users  to  bet¬ 
ter  use  their  services,  but  few  of  them  allow  online  users  to  specify  distrust  relations. 
To  make  use  of  distrust,  we  need  to  make  them  visible  on  social  media  sites  where 
distrust  does  not  explicitly  present.  We  propose  to  predict  distrust  by  mining  social 
media  data.  Before  delving  into  the  discussion  of  distrust  prediction,  we  delineate  its 
differences  from  trust /distrust  prediction  [23]  and  sign  prediction  [101]  as  shown  in 
Figure  4.1.  The  distrust  prediction  problem  in  this  proposal  is  quite  different  from 
the  trust /distrust  prediction  and  sign  prediction  problems  as  follows: 

•  As  shown  in  Figure  4.1a,  the  trust /distrust  prediction  predicts  new  trust  and 
distrust  relations  from  existing  trust  and  distrust  relations.  Our  prediction 
problem,  as  illustrated  in  Figure  4.1c,  assumes  that  distrust  is  not  accessible  in 
data. 

•  The  sign  prediction  problem  as  shown  in  Figure  4.1b  predicts  signs  of  already 
existing  relations.  The  distrust  prediction  problem  needs  to  identify  the  pairs 
of  nodes  between  which  distrust  relations  are  predicted  to  exist. 

1http:/ /www. ciao. co.uk/ 

2http:/ /www.  ebay,  com/ 

3http:/ /www. epinions. com/ 


(a)  Trust /Distrust  Prediction 


(b)  Sign  Prediction 


(c)  Distrust  Prediction 


Figure  4.1:  An  Illustration  of  the  Differences  of  Trust/Distrust  Link  Prediction, 
Sign  Prediction  and  Distrust  Prediction. 


Figure  4.2:  An  Illustration  of  Interaction  Data  in  Social  Media. 

4.1  Problem  Statement 

To  preserve  the  generality  of  our  approach,  it  is  important  to  use  data  which  is 
pervasively  available  across  these  social  trust  systems.  Figure  4.2  demonstrates  a 
typical  data  in  social  trust  systems.  First,  an  obvious  source  of  useful  data  is  trust 
information  which  is  commonly  available  in  most  social  trust  systems.  Second,  in 
most  these  sites,  users  can  create  or  post  user-generated  content  and  other  users  can 
comment,  like/dislike  and  rate  such  content.  For  example,  in  Epinions,  users  can  rate 
the  helpfulness  of  reviews  written  by  others.  In  this  work,  we  study  the  novel  problem 
of  distrust  prediction  from  these  two  pervasive  sources  in  social  trust  systems  -  trust 
information  and  content-centric  interactions. 
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Let  V  =  {pi,P2,  •  •  •  ,Pm}  be  the  set  of  M  pieces  of  user-generated  content  such 
as  posts.  We  use  A  e  MnxM  to  denote  the  user-content  authorships,  where  Pp  = 
1  if  pj  is  created  by  ui:  and  Pp  =  0  otherwise.  Users  can  express  opinions  on 
content  via  comments,  likes/dislikes,  and  ratings.  Some  sites  provide  explicit  ways  of 
enabling  user  feedback  on  content.  Examples  include  likes/dislikes  in  eBay,  and  “very 
helpful”  /  “not  helpful”  ratings  in  Epinions.  Other  more  common  forms  of  feedback 
allow  users  to  express  their  opinions  in  the  form  of  textual  comments  and  replies.  In 
such  cases,  we  adapt  off-the-shelf  opinion  mining  tools  to  extract  user  opinions  from 
such  texts.  We  use  O  e  MnxM  j-Q  represent  the  user-post  opinion  relations  where 
O ij  =  1,  O ij  =  —  1  and  Op  =  0,  if  Ui  expresses  positive,  negative  and  neutral  (or  no) 
opinions,  respectively,  on  pr 

With  the  aforementioned  notations  and  definitions,  the  problem  of  distrust  pre¬ 
diction  with  trust  relations  and  content-centric  user  interactions  is  formally  defined 
as  follows: 

Given  trust  relations  T?  and  content- centric  user  interactions  P  and  O,  we  aim  to 
develop  a  predictor  f  to  predict  distrust  relations  D  with  T,  P  and  O  as, 

/:  {T.  P  O}  — >■  D  (4.1) 

4.2  Data  Analysis 

Because  trust  prediction  is  dependent  on  “typical”  behavior  of  trust  networks  such 
as  transitivity  and  similarity,  it  is  natural  to  explore  similar  properties  of  distrust  with 
respect  to  trust  relations  and  and  content-centric  interactions.  Such  an  understanding 
lays  the  groundwork  for  a  meaningful  distrust  prediction  model.  Note  that  distrust 
relations  in  data  analysis  only  serve  as  a  ground-truth  about  typical  properties  and  the 
underlying  social  theories.  However,  they  will  not  be  explicitly  used  in  the  proposed 
frameworks  for  the  problem  of  distrust  prediction. 
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Figure  4.3:  Ratio  Distribution  of  the  Length  of  Shortest  Path  for  Pairs  with  Distrust 
Relations  in  the  Trust  Network. 

4.2.1  Where  Are  our  “Foes”? 

Our  first  analytical  task  is  to  examine  the  typical  structural  relationships  of  “foes” 
(or  users  with  distrust  relations)  within  the  trust  network.  In  other  words,  if  Ui  has  a 
distrust  relation  to  Uj  in  the  distrust  network  D,  we  investigate  the  typical  position 
of  Uj  with  respect  to  u*  in  the  trust  network  T. 

For  each  distrust  link  Ui-Uj  in  D,  we  use  breadth-first  search  to  compute  the 
shortest  path  from  Ui  to  u3  in  T.  If  paths  exist  from  Ui  to  Uj ,  we  report  the  length 
of  the  shortest  path.  Otherwise  we  report  the  length  as  “inf”  to  indicate  there  is  no 
path  from  Ui  to  Uj  in  T.  The  ratio  distributions  of  the  lengths  of  the  shortest  paths 
for  all  distrust  relations  are  demonstrated  in  Figure  4.3.  More  than  45%  of  distrust 
relations  Ui-Uj  have  shortest  path  lengths  less  than  3,  and  more  than  80%  of  them 
have  shortest  path  lengths  less  than  4.  These  results  suggest  that  our  “foes”  are  often 
close  to  us  in  the  trust  network  T.  For  example,  about  82.64%  of  enemy-pairs  are 
within  3-hops  of  each  other  in  the  trust  network  of  Epinions. 
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4.2.2  Social  Theories  in  Trust /Distrust.  Networks 


We  can  view  trust /distrust  networks  as  signed  networks.  In  this  subsection, 
we  investigate  two  of  the  most  important  social  theories  for  signed  networks  in 
trust /distrust  networks  -  balance  theory  [25]  and  status  theory  [23,  39]. 

For  a  triad,  four  possible  sign  combinations  exist  -  A(+,+,+),  B(+,+,-)  C(+,-,-) 
and  D Among  these  four  combinations,  A  and  C  are  balanced.  The  way  to 
measure  balance  of  trust/distrust  networks  is  to  examine  all  these  triads  and  then 
to  compute  the  ratio  of  A  and  C  over  A,  B,  C  and  D.  We  computed  the  relative 
presence  of  these  four  possible  combinations  and  find  that  92.31%  triads  in  Epinions 
are  balanced. 

While  balance  theory  is  developed  for  undirected  signed  networks,  for  directed 
signed  networks,  status  theory  is  introduced  in  [23,  39].  In  status  theory,  a  trust 
relation  from  Ui  to  Uj  indicates  that  u*  has  a  higher  status  than  Uj\  while  a  distrust 
relation  from  Ui  to  Uj  indicates  that  Ui  has  a  lower  status  than  Uj.  For  a  triad,  status 
theory  suggests  that  if  we  take  each  distrust  relation,  reverse  its  direction,  and  flip  its 
sign  to  trust,  then  the  resulting  triangle  (with  all  trust  relations)  should  be  acyclic. 
We  first  obtain  all  triads  and  then  follow  the  aforementioned  way  to  examine  whether 
these  triads  satisfy  status  theory  or  not.  We  find  that  94.73%  of  triads  in  Epnions 
satisfy  status  theory. 

4.2.3  Distrust  Relations  and  Content-centric  Interactions 

Content-centric  interactions  relate  the  opinion  of  user  tp  on  the  content  posted 
by  user  Uj.  The  user  Ui  can  express  negative  opinions  on  content  posted  by  another 
user  Uj  by  disliking,  giving  negative  comments,  or  negative  ratings.  Such  types  of 
content-centric  interactions  may  be  viewed  as  negative  interactions  between  Ui  and 
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Uj.  A  negative  interaction  from  ut  to  Uj  is  often  a  manifestation  of  user  tq’s  disagree¬ 
ment  and  antagonism  toward  Uj.  It  is  therefore  reasonable  to  surmise  that  negative 
interactions  might  be  correlated  with  distrust  relations.  In  this  subsection,  we  study 
the  correlation  between  negative  interactions  and  distrust  relations. 

Let  N  G  Mnxn  be  a  user-user  negative  interaction  matrix  where  Ny  denotes  the 
number  of  negative  interactions  from  iq  to  Uj.  We  can  obtain  N  from  the  user-content 
authorship  matrix  A  and  the  user-content  opinion  matrix  O  as  N  =  —  A(0~)T  where 
cr  =  is  the  negative  part  of  O.  To  verify  the  correlation  between  negative 

interactions  and  distrust  relations,  we  aim  to  answer  the  following  question:  Are  pairs 
of  users  with  negative  interactions  more  likely  to  have  distrust  relations  than  those 
without  negative  interactions? 

For  each  pair  ( Ui,Uj )  with  negative  interactions  (or  ^  0),  we  first  randomly 
select  a  user  Uk  that  rq  does  not  have  negative  interactions  with  (or  N2J  =  0),  and 
then  use  S  (or  R)  to  indicate  whether  (ui,Uj)  (or  ( ))  has  a  distrust  relation 
where  S  =  1  (or  R  —  1)  if  tq  has  a  distrust  relation  to  Uj  (or  ut  has  a  distrust  relation 
to  Uk),  otherwise  S  =  0  (or  R  =  0).  Let  s  be  a  vector  of  S s  over  all  pairs  of  users 
with  negative  interactions  and  r  be  the  corresponding  vector  of  Rs.  We  conduct  a 
two-sample  f-test  on  s  and  r.  The  null  hypothesis  and  the  alternative  hypothesis  are 
defined  as: 


Ho  :  s  <  r,  H\  :  s  >  r.  (4.2) 

The  null  hypothesis  is  rejected  at  significance  level  a  =  0.01  with  p-value  of  5.72e- 
89  in  Epinions.  Evidence  from  the  f-test  suggests  a  positive  answer  to  the  question: 
there  is  a  strong  correlation  between  negative  interactions  and  distrust  relations,  and 
users  with  negative  interactions  are  likely  to  have  distrust  relations. 

We  further  investigate  the  direct  impact  of  negative  interactions  on  distrust  re- 
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Figure  4.4:  Number  of  Negative  Interactions  and  Distrust  Relations. 


lations.  For  a  given  value  of  K ,  we  calculated  the  ratio  of  pairs  with  both  distrust 
relations  and  at  least  K  negative  interactions  over  all  pairs  with  at  least  K  negative 
interactions.  The  ratio  distributions  with  respect  to  the  number  of  negative  interac¬ 
tions  are  demonstrated  in  Figures  4.4.  Note  that  the  ratios  of  randomly  selected  pairs 
with  distrust  relations  among  all  n{n  —  1)  pairs  of  users  are  2.4177e-04  in  Epinions. 
From  the  figure,  we  note  that  the  ratios  are  much  higher  than  the  random  ones  even 
when  K  is  very  small.  This  observation  further  supports  the  existence  of  the  correla¬ 
tion  between  negative  interactions  and  distrust  relations.  Furthermore  with  increase 
of  K,  the  ratios  tend  to  increase.  Therefore,  an  increase  in  the  number  of  negative 
interactions  increases  the  likelihood  of  distrust  relations  between  users. 

4.2.4  Discussion 

We  summarize  the  insights  obtained  in  the  aforementioned  discussion  as  follows: 

•  Most  of  our  “foes”  are  close  to  us  within  a  few  (e.g.,  2  or  3)  hops  in  the  trust 
network; 
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•  Most  of  triads  in  trust/distrust  networks  satisfy  balance  theory  and  status  the¬ 
ory; 

•  Pairs  of  users  with  negative  interactions  are  more  likely  to  have  distrust  relations 
than  those  without  them;  and 

•  Negative  interactions  between  users  increase  the  propensity  of  distrust  relations. 

These  observations  provide  the  groundwork  for  the  following  proposed  frameworks 
for  distrust  prediction.  Algorithms  for  all  variations  of  trust  prediction  are  either 
unsupervised  methods  [23,  77]  or  supervised  methods  [43,  59].  In  the  following  two 
subsections,  we  will  investigate  distrust  prediction  in  both  unsupervised  [80]  and 
supervised  scenarios  [73]. 

4.3  Unsupervised  Distrust  Prediction 

Traditional  unsupervised  trust  prediction  are  usually  based  on  certain  properties 
of  trust  [87]  such  as  transitivity  [23]  and  low-rank  representation  [77].  Similarly,  we  try 
to  model  findings  and  observations  of  distrust  in  the  last  subsection  for  unsupervised 
distrust  prediction. 


4.3.1  Pseudo  Distrust  Relations 

We  first  divide  all  n(n  —  1)  pairs  of  users  into  three  groups  Q  =  {Gi,  G2,  G3}  as: 

•  Q\  contains  pairs  of  users  with  trust  relations  as: 

Gi  =  =  1};  (4.3) 

•  G-2  is  the  set  of  pairs  of  users  without  trust  relations  but  with  negative  interac¬ 
tions  as: 

G-2  =  {( Ui ,  uj) |T„  =  0  A  Njj  >  0};  (4.4) 
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•  03  includes  the  remaining  pairs  as: 

03  =  { (it* ,  Uj )  | (rt j ,  Uj )  ^  (<7i  U  g2)}.  (4.5) 

Qx  and  02  correspond  to  the  set  of  pairs  of  users  with  trust  relations  and  negative 
interactions,  respectively.  Based  on  these  three  groups,  we  introduce  a  matrix  F  G 
Mnxn  to  represent  user-user  trust  relations  and  pseudo  distrust  relations,  and  the 
entities  of  F  are  defined  as  follows: 

•  For  ( Ui,Uj )  G  0i,  we  set  F?J-  =  1  since  Ui  trusts  Uj ; 

•  For  (u^  Uj)  G  02,  Ui  has  negative  interactions  to  Uj,  and  according  to  the  correla¬ 
tion  between  negative  interactions  and  distrust  relations,  Ui  is  likely  to  distrust 
Uj ;  hence,  we  assign  a  pseudo  distrust  relation  from  iq  to  Uj  by  setting  F^  =  — 1; 

•  We  do  not  have  evidence  of  possible  relations  for  ( Ui,Uj )  G  03,  therefore  we  set 
F  ij  —  0. 

the  entities  of  F  are  formally  defined  as  follows: 

1  if  (ui,  Uj)  G  0i 

-1  if  (ui,  Uj)  G  02  •  (4-6) 

0  if  (ui,Uj)  G  03 

The  values  in  F  may  be  not  equally  reliable.  For  example,  Ftj  for  ( u^Uj )  G  0i 
is  very  reliable  since  we  observe  trust  relations,  while  values  of  pairs  in  02  with 
more  negative  interactions  are  more  reliable  based  on  the  finding  -  the  more  negative 
interactions  two  users  have,  the  more  likely  a  distrust  relation  between  them  exists. 
Therefore,  we  define  a  weight  matrix  W  G  Mnxn  where  W.y-  G  [0, 1]  is  a  weight  to 
indicate  the  reliability  of  Fy-.  Next  we  define  the  weight  matrix  as 

•  We  observe  trust  relations  for  pairs  in  Gi]  hence  for  (iq,  Uj)  G  G i,  we  set  =  If 
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•  Our  previous  finding  reveals  that  the  more  negative  interactions  two  users  have, 
the  more  likely  a  distrust  relation  between  them  exists;  hence  for  ( Ui,Uj )  G  Q2, 
W ij  is  defined  as  a  function  of  the  number  of  negative  interactions  as  W?J-  = 
g(Nij).  The  function  g(x)  has  following  properties  -  (1)  a;  is  a  positive  integer; 
(2)  g(x)  G  [0,1];  and  (3)  g(x)  is  non- decreasing  function  of  x;  and 

•  We  set  W ij  to  be  a  constant  c  G  [0, 1]  for  ( ui:Uj )  G  Q3. 


the  weight  matrix  W  is  formally  defined  as, 


Wy  =  < 


1  if  (wj,  Uj)  G  Q\ 

fi'(Ny)  if  (ui,uj)eG2  ■ 

c  if  {ui,Uj)  G  Qz 

4.3.2  An  Unsupervised  Framework  -  dTrust 


(4.7) 


With  trust  and  pseudo  distrust  relations  F  and  its  weight  matrix  W,  the  problem 
of  distrust  prediction  boils  down  to  a  special  trust  and  distrust  prediction  problem. 
Therefore  we  can  choose  a  representative  trust  and  distrust  prediction  algorithm  as 
our  basic  algorithm.  In  this  work,  we  choose  the  matrix  factorization  based  method 
introduced  in  Section  3.4.1.  However,  we  may  not  apply  it  to  our  problem  directly 
since  the  values  in  F  may  not  be  reliable.  The  proposed  framework  dTrust  is  based 
on  the  new  formulation  with  the  user-user  trust  and  pseudo  distrust  relations  F  and 
its  weight  matrix  W  as, 

n  n 

nun  Y. £  (W«dV  -  U.HUj  -  Ar.r,))2  +  a(||U||J.  +  ||H||J.  +  |r|D,  (4.8) 

i=l  j= 1 

where  the  contribution  of  F^  to  the  learning  process  is  controlled  by  Wy.  A  large 
value  of  Wy ,  indicating  the  high  reliability  of  F?J ,  will  force  U,HUj  to  tightly  fit 
F ij,  while  UjHUj  will  loosely  approximate  F^  when  W.y-  is  small. 
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Eq.  (4.8)  can  be  rewritten  to  its  matrix  form  as 


min  ||W0(F-UHUT-ArrT)|||  +  a(||U||i.+  ||V||'J.+  ||r||l),  (4,9) 

U,H,r 

where  0  is  the  Hadamard  product  where  (X©  Y)y  =  x  Ytj  for  any  two  matrices 
X  and  Y  with  the  same  size.  The  significance  of  the  introduction  of  pseudo  distrust 
relations  in  F  is  three  fold.  First,  it  enables  us  to  predict  distrust  relations  by  incor¬ 
porating  interaction  data.  Second,  it  paves  a  way  to  model  the  correlation  between 
negative  interactions  and  distrust  relations.  Finally  the  introduction  of  pseudo  dis¬ 
trust  relations  enables  us  to  exploit  social  theories  from  signed  networks  with  trust 
and  pseudo  distrust  relations,  while  exploiting  social  theories  in  turn  may  help  us 
mitigate  the  effects  of  unreliable  values  in  F.  and  potentially  improves  the  distrust 
prediction  performance.  The  optimization  problem  in  dTrust  is  solved  by  a  decent 
gradient  method  and  more  details  are  shown  in  Appendix  A. 

4.3.3  Evaluation 

In  this  section,  we  conduct  experiments  to  evaluate  the  effectiveness  of  the  pro¬ 
posed  framework.  In  particular,  we  try  to  answer  two  questions  via  experiments  -  (1) 
can  the  proposed  framework  predict  distrust  information  indirectly  with  interaction 
data?  and  (2)  how  do  the  components  of  dTrust  affect  its  performance?  We  begin  by 
introducing  experimental  settings,  and  then  design  experiments  to  seek  answers  for 
these  questions. 

Experimental  Settings 

Let  A  be  the  set  of  pairs  with  trust  relations  in  the  Epinions  dataset  and  we  sort  A 
in  a  chronological  order  in  terms  of  the  time  when  pairs  established  trust  relations. 
Assume  that  there  are  x%  of  pairs  in  A  establishing  trust  relations  until  time  tx.  For 
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each  x,  we  collect  trust  relations,  distrust  relations,  user-review  authorship  relations 
and  user-review  helpfulness  ratings  until  time  tx  to  form  a  evaluation  dataset  Epin- 
ionsx.  In  this  paper,  we  vary  x  as  {50,  70, 100}  and  correspondingly  we  construct 
three  evaluation  datasets  from  the  Epinions  dataset,  i.e.,  Epinions50,  EpinionslO 
and  Epinions  100.  The  purpose  of  varying  the  values  of  x  is  to  investigate  the  per¬ 
formance  of  the  proposed  framework  on  Epinions  datasets  with  different  statistics. 
Other  experimental  settings  and  evaluation  metric  are  the  same  as  these  in  3.3.2. 

Experimental  Results 

The  comparison  results  are  shown  in  Figure  4.5  and  baseline  methods  in  the  figure 
are  defined  as, 

•  neglnter.  This  method  is  based  on  the  strong  correlation  between  negative  in¬ 
teractions  and  distrust  relations,  neglnter  ranks  pairs  of  users  based  on  the 
numbers  of  negative  interactions.  The  larger  the  number  of  negative  interac¬ 
tions,  the  higher  the  prediction  confidence. 

•  random :  this  predictor  ranks  pairs  of  users  randomly.  [41]  suggests  that  a  ran¬ 
dom  predictor  should  be  used  as  a  baseline  method  to  meaningfully  demonstrate 
the  predictor  quality  since  the  Ml  value  is  usually  low. 

From  Figure  4.5,  we  note  that  the  performance  of  neglnter  is  much  better  than 
that  of  random ,  which  further  demonstrates  the  existence  of  a  strong  correlation  be¬ 
tween  negative  interactions  and  distrust  relations.  We  also  observe  that  the  proposed 
framework  dTrust  always  outperforms  baseline  methods.  Performance  comparison 
between  the  random  predictor  and  the  proposed  framework  dTrust  suggests  that 
dTrust  can  accurately  predict  distrust  relations  by  incorporating  trust  relations  and 
interaction  data. 
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Figure  4.5:  Performance  Comparison  of  Different  Predictors. 


g(x)  controls  the  impact  of  the  number  of  negative  interactions  on  dTrust  and  we 
empirically  set  g{x)  =  1  —  iogl^.+1)  •  Next  we  investigate  the  importance  of  the  number 
of  negative  interactions  by  studying  the  performance  changes  of  dTrust  with  difference 
choices  of  g{x)  as  shown  in  Table  4.1.  Note  that  “random”  in  the  table  denotes  that  we 
randomly  assign  values  in  [0, 1]  to  the  function.  We  make  the  following  observations 

•  When  g(x)  =  0,  we  eliminate  negative  interactions  and  the  performance  re¬ 
duces  dramatically.  This  result  demonstrates  the  importance  of  incorporating 
interaction  data. 

•  Compared  to  the  performance  of  g(x)  =  1  —  the  performance  g(x)  with 

a  non-zero  constant  degrades  a  lot.  These  results  suggest  that  modeling  the 
impact  of  the  number  of  negative  interactions  on  the  correlation  can  improve 
the  performance  of  dTrust. 

•  Compared  to  the  performance  of  g(x)  =  1  —  the  performance  g(x)  with 

random  values  also  reduces  a  lot.  These  results  directly  suggest  that  g(x)  should 
not  be  random  values,  and  further  demonstrate  the  importance  of  modeling  the 
impact  of  the  number  of  negative  interactions  by  W. 
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Epinions50 

Epinions70 

EpinionslOO 

g(x)  =  0 

0.00001 

0.00003 

0.00004 

g{x)  =  1 

0.05812 

0.11686 

0.13039 

g(x)  =  random 

0.05905 

0.11763 

0.13207 

9(x)  =  1  io9(l+ 1) 

0.08737 

0.15054 

0.17391 

Table  4.1:  Difference  Definitions  of  g(x). 

We  can  conclude  that  (1)  g{x)  should  not  be  random  values;  (2)  defining  g{x)  based 
on  the  number  of  negative  interactions  can  significantly  improve  the  performance  of 
dTrust. 


4.4  Supervised  Distrust  Prediction 

Traditional  supervised  methods  consider  trust  prediction  as  as  classification  prob¬ 
lem  [87].  Supervised  methods  could  have  several  advantages  over  unsupervised  meth¬ 
ods  such  as  superior  performance,  adaptability  to  different  data  domains,  and  variance 
reduction  [42] .  In  this  subsection,  we  investigate  how  to  tackle  the  problem  of  distrust 
prediction  with  supervised  learning.  Similar  to  traditional  trust  prediction,  we  can 
consider  distrust  prediction  problem  as  a  classification  problem  where  we  need  to  con¬ 
struct  labels  and  extract  features.  Different  from  traditional  trust  prediction,  there 
are  unique  challenges  in  preparing  training  data  in  the  distrust  prediction  problem. 
For  example,  existing  relations  are  given  in  traditional  trust  prediction  such  as  trust 
relations  in  trust  prediction,  and  trust /distrust  relation  in  trust /distrust  prediction, 
while  existing  distrust  relations  are  not  given  in  the  distrust  prediction  problem. 
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4.4.1  Label  Construction 


Let  En  and  E0  denote  pairs  of  users  with  distrust  relations  and  without  any 
relations,  respectively.  In  most  previous  formulations  of  link  prediction,  including  the 
signed  version,  label  construction  is  trivial  because  the  presence  of  links  is  specified. 
However,  we  study  the  scenario  where  no  distrust  relations  are  provided,  and  therefore 
the  labels  for  En  are  really  an  unspecified  subset  of  E0U  En.  What  is  worse,  the  sizes 
of  En  and  E0  are  extremely  imbalanced.  For  example,  the  imbalance  ratios  En  :  E0 
are  1:4131  in  Epinions.  We  treat  pairs  in  E0  as  positive  samples  and  distrust  relations 
as  negative  samples.  Label  construction  is  to  construct  positive  and  negative  samples 
from  E0  U  En.  Since  the  ratio  of  E0  in  En  U  E0  are  often  bigger  than  99.9%,  we  can 
randomly  select  a  subset  of  samples  from  En  U  E0  as  positive  samples  PS.  Next  we 
introduce  a  way  to  select  samples  from  En  U  E0  as  negative  samples  based  on  previous 
observations.  The  process  is  shown  in  Algorithm  2. 

Next,  we  describe  Algorithm  2  for  negative  sample  construction.  The  strong  cor¬ 
relation  between  negative  interactions  and  distrust  relations  suggests  that  users  with 
negative  interactions  are  likely  to  have  distrust  relations.  Therefore  from  line  2  to 
line  4  in  Algorithm  2,  we  construct  negative  sample  candidates  based  on  this  obser¬ 
vation.  With  the  trust  relations  and  distrust  relations  Ui-Uj  from  NS,  we  construct  a 
signed  network  Q  in  line  5.  Most  of  the  triads  in  signed  networks  satisfy  status  theory. 
Therefore  we  refine  NS  by  (a)  excluding  ( Ui,Uj )  from  NS  if  Ui-Uj  is  in  any  triads  of 
Q  that  does  not  satisfy  status  theory  in  line  6;  and  (b)  adding  samples  ( Ui,Uk )  into 
NS  if  Ui-Uk  can  make  all  triads  that  involve  u*  and  in  Q  satisfying  status  theory  in 
line  7.  The  reliability  of  these  negative  samples  may  vary.  For  example,  observations 
from  data  analysis  indicate  that  negative  sample  candidates  with  more  negative  in¬ 
teractions  are  more  likely  to  have  distrust  links,  and  are  therefore  more  likely  to  be 
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Algorithm  2  Negative  Sample  Construction 

Input  :  The  trust  network  T  and  user-user  negative  interaction  matrix  N 
Output  :  Negative  sample  set  NS  and  the  reliability  weight  matrix  W 
1:  Initialize  NS  =  0 

2:  for  all  N ij  7^  0  do 

3:  NS  =  NSuHm,^)} 

4:  end  for 

5:  Construct  Q  as  a  signed  network  with  trust  relations  from  T  and  distrust  relations 
Ui-Uj  from  NS 

6:  Remove  samples  ( u^Uj )  from  NS  if  Ui-Uj  is  in  any  triads  of  Q  that  does  not 
satisfy  status  theory 

7:  Add  samples  (ui,Uk)  into  NS  if  Ui-Uk  can  make  all  triads  that  involve  u*  and 
in  Q  satisfying  status  theory 
8:  for  all  ( Ui,Uj )  G  NS  do 
9:  Calculate  a  reliability  weight  Wy 

10:  end  for 


reliable.  Therefore,  we  associate  each  (ut ,  Uj )  with  a  reliability  weight  WtJ)  which  is 
defined  as  follows: 


w,  =  ( 9(N«> 

r 


if  Nij  +  0 


otherwise 


(4.10) 


if  the  pair  ( u^Uj )  G  NS  has  negative  interactions,  we  define  the  reliability  weight 
as  a  function  g  of  the  number  of  negative  interactions  Ny.  Otherwise,  the  pair 
(ui,Uj)  G  NS  is  added  by  line  7  in  Algorithm  2  and  we  set  the  reliability  weight  to  a 
constant  r. 


43 


4.4.2  Feature  Extraction 


We  extract  three  types  of  features  corresponding  to  user  features,  pair  features  and 
sign  features.  User  features  and  pair  features  are  extracted  from  two  given  sources, 
such  as  trust  relations  and  content-centric  interactions,  as  follows: 

•  User  features  are  extracted  for  each  user  ut  including  itj’s  indegree  (or  outdegree) 
in  terms  of  trust  relations,  the  number  of  triads  that  u,  involved  in,  the  number 
of  content-centric  items  (e.g.,  posts)  that  Ui  creates,  the  number  of  uf  s  posts 
that  obtain  positive  (or  negative)  opinions,  and  the  number  of  positive  (or 
negative)  opinions  Ui  expresses;  and 

•  Pair  features  are  extracted  for  a  pair  of  users  (ui,Uj)  including  the  number 
of  positive  (or  negative)  interactions  from  Ui  to  Uj,  the  number  of  positive 
(or  negative)  interactions  from  Uj  to  Ui,  Jaccard  coefficients  of  indegree  (or 
outdegree)  of  Ui  and  Uj  in  terms  of  trust  relations,  and  the  length  of  the  shortest 
path  between  and  Uj. 

We  construct  a  weighted  signed  network  with  the  given  trust  relations  and  distrust 
relations  from  NS  where  the  weights  of  trust  relations  are  1  and  the  weights  of 
distrust  relations  are  their  reliability  weights.  For  a  pair  ( Ui,Uj ),  signed  features 
include  weighted  indegree  (or  outdegee)  in  terms  of  trust  relations  of  weighted 
indegree  (and  outdegee)  in  terms  of  distrust  relations  of  Uj,  Jaccard  coefficients  of 
indegree  (or  outdegree)  of  w*  and  Uj  in  terms  of  distrust  relations  and  16  weighted 
triads  suggested  by  [38]. 

With  definitions  of  user  features,  pair  features  and  sign  features,  we  extract  45 
features  in  total  for  each  pair  (ui,Uj)  including  8  user  features  of  Ui,  8  user  features 
of  Uj,  7  pair  features,  and  22  signed  features. 
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4.4.3  A  Supervised  Framework  -  NeLP 


Through  label  construction  and  feature  extraction,  we  prepare  training  data  to 
learn  classifiers  for  the  distrust  prediction  problem.  However,  the  labels  of  the  training 
data  are  noisy,  and  especially  so  for  negative  samples.  Therefore,  it  is  necessary  for 
the  base  classifier  to  be  tolerant  to  training  data  noise.  In  this  paper,  we  choose  a 
soft-margin  version  of  support  vector  machines  as  our  basic  classifier  because  it  has 
been  proven  to  be  highly  noise-tolerant  [14]. 

Let  X  =  {x\,  X2, . . .  ,xn}  be  the  set  of  user  pairs  in  E0  U  En  and  x,:  be  the  feature 
vector  representation  of  the  pair  x%.  The  standard  soft-margin  support  vector  machine 
for  the  distrust  prediction  problem  is  as  follows: 


min 

w,6,e 

S.t 


^llwll!  +  c'  €i 

XiGPSUNS 

r/j(wTx,:  +  b)  >  1  —  Cj,  Xi  G  PS  U  NS 
€i  >  0  Xi  £  PS  U  NS 


(4.11) 


Eq.  (4.11)  introduces  the  term  et  for  the  soft-margin  slack  variable  of  xt,  which 
can  be  viewed  as  the  allowance  for  the  noise  in  this  training  sample.  The  parameter 
C  controls  the  degree  of  impact  of  this  term.  In  the  distrust  prediction  problem, 
the  noise-levels  of  positive  and  negative  samples  are  different  because  positive  sam¬ 
ples  PS  are  generally  more  robust  than  the  (indirectly  derived)  negative  samples. 
As  discussed  earlier,  the  reliability  of  negative  samples  is  explicitly  quantified  with 
their  weights.  These  intuitions  suggest  that  we  should  allow  more  errors  in  negative 
samples  especially  when  their  weights  suggest  unreliability.  This  yields  the  following 
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formulation: 


2  _  _ 

min  — 1| w|||  +  Cp  Y"  e*  +  Cn  V'  c^- 

w,b,e  Z  z z — ' 

Xi&PS  Xj&NS 

S.t.  Ui (wTXj  +  b)  >  1  -  €i,  Xi  G  PS 

jq(wTXj  +  b)  >  1  —  6j,  Xj  e  NS 

6i  >  0,  €j  >  0  (4.12) 

In  Eq.  (4.12),  we  use  two  parameters  Cp  and  Cn  to  weight  the  positive  and  negative 
errors  differently.  We  use  a  larger  value  for  Cp  compared  to  Cn  to  reflect  the  differential 
behavior  of  the  positive  and  negative  samples.  For  a  negative  sample  Xj,  we  introduce 
a  weight  Cj  to  further  control  its  error  based  on  its  quantified  reliability  weight.  For 
the  negative  sample  x3  corresponding  to  the  pair  (rq,  Uk),  we  set  c3  =  W ^  where  W* 
is  the  reliability  weight  for  ( Ui,Uk )■  This  additional  term  allows  differential  control  of 
the  noise  in  negative  samples  of  varying  reliability. 

Balance  theory  suggests  that  triads  in  signed  networks  are  likely  to  be  balanced; 
hence  we  want  to  maintain  or  increase  the  structural  balance  after  distrust  prediction. 
If  there  is  a  trust  relation  between  tq  and  Uj,  and  both  rq  and  u3  do  not  have  trust 
relations  with  another  user  the  types  of  ( tq,^ )  and  ( Uj,Uk )  in  the  distrust  graph 
Qn  are  likely  to  be  the  same.  In  other  words,  to  maintain  or  increase  the  structural 
balance,  it  is  likely  that  both  are  distrust  relations  where  ( rq ,  Uj,  u *,)  forms  a  balanced 
triad  or  both  are  missing  relations  where  there  is  no  triad  among  ( Ui,Uj,Uk )■  With 
this  intuition,  we  introduce  a  matrix  B  where  B he  =  1  if  there  is  a  trust  relation 
between  tq  and  Uj  where  we  assume  that  Xh  and  X£  denote  pairs  (ui,Uk)  and  (uj,Uk), 
respectively.  Otherwise,  we  assume  that  By  =  0.  Then,  we  force  Xh  and  xi  to  have 
the  same  types  of  links  if  By  =  1  by  introducing  a  balance-theory  regularization: 

min  -  Bfr£(wTXh  —  wTx^)^  =  wTX£XTw  (4-13) 

2  h,t 
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Here,  C  is  the  Laplacian  matrix  based  on  B.  The  number  of  pairs  in  En  U  E0  is  usually 
very  large,  which  leads  to  a  large  number  of  terms  in  the  balance  theory  regularization. 
The  observation  from  data  analysis  suggests  that  our  “foes”  are  usually  close  to  us 
in  the  trust  network.  Hence,  in  this  work,  we  only  consider  pairs  whose  shortest 
path  lengths  are  2,  and  pairs  in  NS  and  PS  in  the  balance  theory  regularization.  We 
assume  that  there  are  l+/a  samples  in  X  where  the  first  l  ones  are  from  PS  U  NS.  The 
significance  of  the  introduction  of  the  balance  theory  regularization  is  two-fold.  First, 
it  allows  us  to  model  balance  theory.  Second,  it  allows  us  to  include  more  samples 
during  the  learning  process  in  addition  to  NS  and  PS.  A  similar  function  is  achieved 
by  this  approach,  as  achieved  by  unlabeled  samples  in  semi-supervised  learning  [106]. 
With  these  components,  the  proposed  NeLP  framework  is  able  to  solve  the  following 
optimization  problem: 

min  ^|| w|||  T  Cp  V]  e*  +  Cn  V]  a,e-  +  ^wTX£XTw 

w.o.e  Z  L z '  Z 

XiEPS  XjENS 

s.t.  y.t(wTx?:  +  b)  >  1  —  6i,  XiEPS 
yJ(wTxJ  +  b)  >  1  —  €j,  Xj  G  NS 

6i  >  0,  6j  >  0  (4.14) 

We  solve  the  optimization  problem  in  Eq.  (4.14)  based  on  the  dual  form  [4]  and  more 
details  are  presented  in  Appendix  B. 

4.4.4  Evaluation 

In  this  subsection,  we  present  experiments  which  (a)  quantify  the  performance  of 
the  proposed  NeLP  framework  in  predicting  distrust  links,  and  (b)  evaluate  the  con¬ 
tribution  of  various  model  components  to  the  performance.  We  begin  by  introducing 
performance  evaluation  metrics,  which  are  useful  in  both  contexts. 
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True  class  =  -1 

True  class  =  1 

Predicted  class  =  -1 

true  pos.  (tp) 

false  pos.  (fp) 

Predicted  class  =  1 

false  neg.  (fn) 

true  neg.  (tn) 

Table  4.2:  Confusion  Matrix  of  a  Binary  Classifier. 

Experimental  Settings 

Distrust  prediction  can  be  viewed  as  a  highly  imbalanced  classification  problem.  In 
such  case,  straightforward  accuracy  measures  are  well  known  to  be  misleading  [96]. 
For  example,  the  trivial  classifier  that  labels  all  samples  as  missing  relation  can  have  a 
99.99%  accuracy  in  Epinions.  In  distrust  prediction,  we  aim  to  achieve  high  precision 
and  recall  over  distrust  relations,  defined  in  terms  of  the  confusion  matrix  of  a  classifier 
as  shown  in  Table  4.2:  precision  =  tp+fp  and  recall  =  •  Usually  precision  and 

recall  are  combined  into  their  harmonic  mean,  the  F-measure.  Therefore  we  will  adopt 
Fl-measure  as  one  metric  for  the  performance  evaluation.  As  suggested  in  [96],  in 
some  scenarios,  we  put  more  emphasis  on  precision  because  the  most  challenging  task 
is  to  seek  some  distrust  relations  with  high  probability,  even  at  the  price  of  increasing 
false  negatives.  Hence,  we  also  report  the  precision  performance. 

Performance  of  Distrust  Prediction 

For  the  evaluation  purpose,  we  define  the  following  baseline  methods  for  the  proposed 
framework  NeLP: 

•  Random.  This  predictor  randomly  guesses  the  labels  of  samples. 

•  sPatk  Observations  in  data  analysis  suggest  that  our  “foes”  are  always  close 
to  us  in  the  trust  network  and  sPath  assigns  distrust  relations  to  pairs  whose 
shortest  path  lengths  is  L; 
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Algorithms 

Epinions 

FI 

Precision 

random 

0.0005 

0.0002 

sPath 

0.0040 

0.0075 

negln 

0.2826 

0.2097 

neglnS 

0.2893 

0.2124 

NeLP-negln 

0.3206 

0.2812 

NeLP 

0.3242 

0.2861 

Table  4.3:  Performance  Comparison  of  Distrust  Prediction  in  Epinions. 

•  negln :  Given  the  strong  correlation  between  negative  interactions  and  distrust 
links,  negln  suggests  distrust  relations  to  these  pairs  with  negative  interactions; 

•  neglnS :  after  obtaining  distrust  candidates  via  negln,  neglnS  further  refines 
these  candidates  by  performing  a  removing  step  and  an  adding  step  as  shown 
in  Algorithm  2;  and 

•  NeLP-negln :  NeLP-negln  is  a  variant  of  the  proposed  NeLP  framework.  Instead 
of  using  distrust  links  suggested  by  neglnS  as  NeLP,  NeLP-negln  uses  distrust 
relations  found  by  negln. 

For  parameterized  methods,  we  report  the  best  performance  of  each  baseline 
method.  For  NeLP,  we  set  its  parameters  as  {Cp  =  1 ,  CJn  =  0.5,  Cf,  =  0.1}.  We 
empirically  find  that  g(x)  =  1  —  log^+J)  works  well  for  the  proposed  framework.  More 
details  about  parameter  sensitivity  of  NeLP  will  be  discussed  later.  The  comparison 
results  are  demonstrated  in  Table  5.1. 

We  make  the  following  observations: 
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•  sPath  obtains  much  better  performance  than  random  guessing,  which  further 
supports  the  hypothesis  that  our  “foes”  are  close  to  us  in  the  trust  network; 

•  negln  improves  the  performance  significantly.  These  results  suggest  the  exis¬ 
tence  of  correlation  between  negative  interactions  and  distrust  relations; 

•  by  removing  candidates  suggested  by  negln  that  do  not  satisfy  status  theory  and 
adding  candidates  to  make  open  triads  closure  to  satisfy  status  theory,  neglnS 
outperforms  negln.  For  example,  neglnS  gains  2.37%  relative  improvement  in 
terms  of  Fl-measure  in  Epinions.  These  results  indicate  that  status  theory  can 
help  us  remove  some  noisy  samples  and  add  some  useful  samples  for  training. 
These  observations  can  also  be  used  to  explain  the  reason  why  the  performance 
of  NeLP  based  on  distrust  relations  suggested  by  neglnS  is  better  than  that 
based  on  negln ;  and 

•  The  proposed  framework  always  obtains  the  best  performance. 

Component  Analysis  of  NeLP 

There  are  three  important  components  of  NeLP.  First,  NeLP  introduces  Cn  to  control 
errors  in  negative  samples.  Second,  NeLP  introduces  Cj  to  control  the  error  in  the 
sample  Xj,  which  is  related  to  the  number  of  negative  interactions  based  on  our  ob¬ 
servations  from  data  analysis.  Third,  NeLP  introduces  balance  theory  regularization 
to  model  balance  theory,  which  also  allows  us  to  include  more  samples  in  the  clas¬ 
sifier  learning  process.  Next  we  discuss  the  effects  of  these  components.  By  setting 
Cp  —  1  and  varying  different  values  of  Cn,  Cj  and  we  can  examine  the  impact 
of  these  components  on  the  performance  of  NeLP.  The  results  of  component  analysis 
are  shown  in  Tables  4.4. 
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Cn 

C3 

cb 

F  1-measure 

Precision 

0.5 

f(X)  =  1  108(1+*) 

0.1 

0.3242 

0.2861 

1 

f(X)  =  1  108(1+*) 

0.1 

0.3188 

0.2793 

0.5 

f(x)  =  1 

0.1 

0.3067 

0.2612 

0.5 

f(X)  =  1  log(l+*) 

0 

0.3084 

0.2686 

1 

f(x)  =  1 

0 

0.2992 

0.2342 

Table  4.4:  Component  analysis  for  Ne’ 


P  in  Epinions. 


The  first  row  in  the  table  represents  the  performance  of  NeLP  with  all  three 
components.  We  make  the  following  observations  about  different  variations  of  NeLP 
in  other  rows  of  the  table: 

•  in  the  second  row,  we  set  CJn  —  1,  which  gives  equal  weights  to  positive  and 
negative  samples.  This  approach  effectively  eliminates  the  differential  impor¬ 
tance  given  to  errors  from  negative  samples.  The  performance  degrades,  which 
suggests  that  the  errors  of  negative  and  positive  samples  should  be  treated 
differently; 

•  in  the  third  row,  we  set  Cj  =  1  instead  of  the  reliability  weight  related  to  the 
number  of  negative  interactions  to  eliminate  the  component  controlling  the  error 
in  the  negative  sample  xr  The  performance  reduces  a  lot.  For  example,  the 
precision  reduces  by  8.70%  in  Epinions.  These  results  support  the  importance 
of  the  number  of  negative  interactions  to  indicate  the  reliability  of  negative 
samples; 

•  in  the  fourth  row,  we  set  Cb  =  0  to  eliminate  the  contribution  from  the  balance 
theory  regularization.  The  performance  is  consistently  worse  than  that  with  the 
balance  theory  regularization.  This  illustrates  the  importance  of  the  balance 
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theory  regularization  in  the  proposed  NeLP  framework;  and 

•  in  the  fifth  row,  we  eliminate  all  these  three  components  and  the  performance 
further  degrades.  These  results  suggest  that  the  three  components  contain 
complementary  information. 

4.5  Conclusion 

In  this  chapter,  we  propose  an  unsupervised  framework  dTrust  and  a  supervised 
framework  NeLP  to  predict  distrust  by  leveraging  trust  and  content-centric  user  in¬ 
teractions.  We  make  a  number  of  findings  about  distrust  including  (1)  our  distrusted 
users  are  close  to  us  (usually  within  2  or  3  hops);  (2)  most  triads  in  trust /distrust 
networks  satisfy  balance  and  status  theories;  (3)  there  is  a  strong  correlation  between 
distrust  and  negative  interactions;  and  (4)  the  more  negative  interactions  two  users 
have,  the  more  likely  a  distrust  relation  exists  between  them.  Evaluations  of  dTrust 
and  NeLP  on  Epinions  suggest  that  (1)  balance  and  status  theories  play  important 
roles  in  distrust  prediction;  (2)  negative  interactions  are  strong  indicators  of  distrust; 
and  (3)  distrust  can  be  accurately  predicted  by  using  trust  and  content-centric  user 
interactions. 


52 


Chapter  5 


APPLYING  DISTRUST 


Trust  is  used  in  many  real- world  applications  such  as  node  classification  [65],  in¬ 
formation  propagation  [31],  recommendation  [82,  91],  information  filtering  [76]  and 
feature  selection  [84,  85,  88,  86].  Distrust  plays  a  different  role  in  many  ways  from 
trust.  Simply  extending  applications  of  trust  may  not  work  for  distrust  [10].  In 
addition,  it  would  be  ideal  to  put  research  findings  into  real-world  applications.  It 
is  also  an  ultimate  assessment  of  impact  for  distrust.  Hence,  we  propose  to  apply 
distrust  together  with  trust  in  our  efforts  to  further  understand  the  role  and  added 
value  of  distrust.  We  focus  on  two  applications  of  distrust  -  node  classification  and 
recommendation.  The  reason  is  two-fold.  First,  these  two  problems  are  quite  general 
and  many  real-world  applications  can  be  essentially  considered  as  one  of  these  two 
problems.  For  example,  sentiment  analysis  can  be  formulated  as  a  classification  prob¬ 
lem  [70,  26,  90].  Second,  these  two  applications  can  serve  as  examples  when  exploring 
other  data-intensive  applications. 

5.1  Node  Classification 

User  information  such  as  demographic  values,  interest,  beliefs  or  other  charac¬ 
teristics  plays  an  important  role  in  helping  social  media  sites  provide  better  services 
for  their  users  such  as  recommendations  and  content  filtering.  However,  most  so¬ 
cial  media  users  do  not  share  too  much  of  their  information  [103].  For  example, 
more  than  90%  of  users  in  Facebook  do  not  reveal  their  political  views  [1].  One  way 
of  bridging  this  knowledge  gap  is  to  infer  missing  user  information  by  leveraging  the 
pervasively  available  network  structures  in  social  media.  An  example  of  such  inference 
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is  that  of  node  classification  in  trust  networks.  The  node  classification  problem  has 
been  extensively  studied  in  the  literature  [19].  Existing  node  classification  algorithms 
can  be  mainly  grouped  into  local  classifier  based  methods  and  random  walk  based 
methods  [5].  The  vast  majority  of  these  algorithms  have  focused  on  trust  networks 
[58,  92,  44,  106,  104,  65],  while  little  work  exists  for  trust /distrust  networks  [71]. 

5.1.1  Problem  Statement 

Let  C  =  {ci,  c2, . . . ,  cm}  be  the  set  of  m  label  classes.  Assume  that  UL  = 
{«i,  u2,  ■  ■  ■ ,  un}  is  the  set  of  N  labeled  nodes  where  N  <  n  and  Uu  =  U\UL  is 
the  set  of  n  —  N  unlabeled  users.  We  use  Y  e  c]enote  the  label  indicator 

matrix  for  UL  where  Y ^  =  1  if  u,  is  labeled  as  c*,,  and  Y ^  =  0  otherwise.  With 
the  aforementioned  notations  and  definitions,  the  problem  of  node  classification  in  a 
trust /distrust  can  be  formally  stated  as  follows: 

Given  a  trust/ distrust  network  Q  with  trust  relations  T,  distrust  relations  D,  and 
labels  Y  for  some  nodes  UL ,  the  problem  of  node  classification  in  a  trust/ distrust 
network  aims  to  infer  labels  for  the  unlabeled  nodes  Uu . 

5.1.2  Transforming  Algorithms  from  Trust  to  Trust  and  Distrust  Networks 

Existing  node  classification  algorithms  in  trust  networks  can  be  mainly  divided 
into  local  classifier  based  methods  and  random  walk  based  methods  [5].  In  this 
subsection,  we  investigate  how  to  generalize  these  key  categories  of  representative 
algorithms  in  trust  networks  to  trust /distrust  networks. 

Local  Classifier  based  Methods 

Local  classifier  based  methods  use  local  neighborhood  information  to  learn  local  clas¬ 
sifiers  [58,  44,  49].  Iterative  classification  methods  (or  ICA)  [44]  are  representative 
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methods  in  this  family.  A  typical  ICA  algorithm  first  extracts  feature  vectors  for 
nodes.  Typically,  three  types  of  features  are  extracted  as  shown  in  Figure  5.1a: 

•  LFIP  =  {/i, /2, . . . , /m}  denotes  the  set  of  features  of  label  distributions  of 
incoming  relations  (or  indegree)  of  the  trust  network  T.  The  number  of  features 
in  LFip  is  equal  to  that  of  class  labels  and  the  value  of  fk  for  Uj  is  often 
calculated  as  the  frequency  of  in  the  incoming  relations  of  Uj  in  T; 

•  Similarly,  LFqp  is  the  set  of  features  of  label  distributions  of  outgoing  relations 
(or  outdegree)  of  T;  and 

•  SFp  is  the  set  of  features  extracted  from  the  topological  information  of  T  such 
as  indegree,  outdegree, and  the  number  of  triads. 

With  the  constructed  features,  it  builds  a  traditional  classifier  with  labeled  nodes 
UL .  The  resulting  classifier  is  used  to  infer  labels  of  nodes  in  Uu  in  an  iterative 
fashion,  where  the  most  confidently  predicted  labels  are  added  to  the  labeled  set.  In 
each  iteration  a  new  set  of  features  LFjp  and  LFop  is  extracted  with  the  augmented 
labels,  and  the  aforementioned  process  is  repeated  until  stable  performance  is  achieved 
or  a  maximum  number  of  iterations  have  been  executed. 

To  transform  the  aforementioned  ICA  algorithm  to  trust /distrust  networks,  we 
define  the  extracted  features  from  trust /distrust  networks  as  shown  in  Figure  5.1b: 

•  We  continue  to  use  the  feature  sets  LFIP  and  LFOP  because  a  trust /distrust 
network  Q  naturally  includes  a  social  network  with  only  trust  relations  T; 

•  Similar  to  LFjp  for  trust  relations,  we  define  LFjn  to  capture  label  distributions 
of  incoming  distrust  relations  (or  indegree  of  trust  relations); 

•  Similar  to  LF0p  for  trust  relations,  LFqn  is  defined  to  capture  label  distribu¬ 
tions  of  outgoing  distrust  relations  (or  outdegree  of  distrust  relations);  and 
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Feature 

Types 


Descriptions 


Ux  U2 


SFP 


LFIP 

Label  features  from  indegree  of  trust  links 

LF0P 

Label  features  from  outdegree  of  trust  links 

LFm 

Label  features  from  indegree  of  distrust  links 

T  F 

L-‘rON 

Label  features  from  outdegree  of  distrust  links 

SFP 

Structure  features  from  the  trust  network 

SFPN 

Structure  features  from  the  trust/distrust  network 

(a)  Trust  Networks  (b)  Trust/Distrust  Networks  (c)  Feature  Descriptions 

Figure  5.1:  Feature  Construction  for  Iterative  Classification  Methods  in  Trust  and 
Trust/Distrust  Networks. 


•  We  redefine  structural  features  SFPP/  for  trust /distrust  networks  according  to 
one  of  the  most  important  theories  for  trust/distrust  networks,  i.e. ,  balance 
theory  [25].  According  to  balance  theory,  £- length  circles  can  capture  important 
topological  properties  of  trust /distrust  [11]  and  we  extract  ^-length  circles  for 
SFpn.  For  example,  when  £  =  3,  SFPN  is  the  set  of  triad  features  suggested 
by  [38], 

A  summary  of  the  extracted  features  for  ICA  in  trust  and  trust /distrust  networks 
is  illustrated  in  Figure  5.1.  To  transform  ICA,  we  define  two  feature  sets  LFim  and 
LFon  based  on  distrust  relations,  and  redefine  structural  features  SFPPr  according 
to  balance  theory. 


Random  Walk  based  Methods 

Random  walk  based  methods  propagate  labels  from  labeled  nodes  to  unlabeled  nodes 
by  performing  random  walks  on  the  trust  network  [6,  106,  104,  105].  Graph  reg¬ 
ularization  based  methods  are  representative  methods  in  this  family  [5].  A  typical 
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formulation  for  graph  regularization  based  methods  is  as  follows  [105]: 


min  Tij(— 7= - +  V  IIla-  —  Yfc||l  (5.1) 

uuiijEU  Jd?p  uk  EUl 

where  L,  is  the  predicted  label  indicator  vector  for  tq  in  the  network  Q,  and  d{p  and 
d°p  are  the  indegree  and  outdegree  of  trust  relations  for  rq  and  Uj,  respectively.  In 
Eq.(  5.1),  the  first  term  ensures  greater  local  consistency  of  labels  of  users  connected 
with  trust  relations,  and  the  second  term  enforces  global  consistency  of  the  inferred 
labels  with  the  provided  training  labels.  The  balance  between  the  two  criteria  is 
controlled  by  a  parameter  /i. 

Next,  we  discuss  two  possible  ways  of  generalizing  the  aforementioned  algorithm 
to  the  trust /distrust  scenario: 


•  The  underlying  assumption  of  the  aforementioned  algorithm  is  that  two  users 
with  a  trust  relation  are  likely  to  share  similar  labels,  which  can  be  explained 
by  two  popular  social  theories,  i.e.,  homophily  and  social  influence.  Intuitively, 
a  distrust  relation  may  denote  dissimilarity  or  distance.  Therefore,  one  possible 
way  to  capture  distrust  relations  is  to  force  labels  of  two  users  with  a  distrust 
relation  dissimilar  by  introducing  a  term  in  Eq.  (5.1)  as: 


mm 


E  T«< 


+>i  e 


^  flip  /5ET'  1  ^ 

,uj&u  Vai  ydj  uk&uL 

E  D«(- 


|Lfc  —  Yfc||l 


(5.2) 


U~^eu  Jd°* 

where  d{N  and  dfN  are  the  indegree  and  outdegree  of  distrust  relations  for  ut 
and  Uj,  respectively,  and  the  third  term  captures  the  contribution  from  distrust 
relations,  which  is  controlled  by  rj. 


•  Previous  work  demonstrated  that  more  than  90%  of  triads  in  trust /distrust 


57 


networks  satisfy  status  theory  [39].  To  apply  Eq.  (5.1),  we  can  convert  distrust 
relations  to  trust  links  via  status  theory. 

Let  Sij  —  1  if  Ui  has  a  trust  relation  to  Uj  and  .SV,  =  —1  for  a  distrust  relation. 
In  [38],  an  approach  based  on  status  theory  is  proposed  to  determine  the  sign 
from  Ui  to  Uk  in  an  open  triad  with  given  signs  between  ut  and  Uj,  and  between 
Uj  and  Uk-  The  algorithm  first  flips  the  directions  of  relations  between  ut  and 
Uj,  and  between  Uj  and  Uk,  if  possible,  so  that  they  point  from  tq  to  Uj  and  Uj 
to  Uk-  Then  it  also  flips  the  signs  of  the  relations  correspondingly  if  we  flip  their 
directions.  Finally  the  sign  from  iq  to  Uk  is  Stk  =  Sl3  +  Sjk- 

According  to  the  aforementioned  algorithm,  we  can  convert  distrust  relations 
into  trust  relations.  Three  examples  are  shown  in  Figure  5.2  where  Uk  — >  Ui 
in  A,  Ui  — >  Uk  in  B  and  Uk  — >  Ui  in  C  are  converted  trust  relations.  We  have 
=  —1  and  Sjk  =  —1  for  the  open  triad  A  in  Figure  5.2  and  next  we  will  use 
A  as  an  example  to  illustrate  how  to  covert  distrust  relations  to  trust  relations. 
According  to  the  algorithm,  we  can  calculate  that  Ski  =  1;  hence,  we  add  a  trust 
relation  from  Uk  to  tq.  A  similar  process  can  be  performed  for  B  and  C.  After 
that,  we  remove  all  distrust  relations  and  the  original  trust /distrust  network  is 
converted  into  a  trust  network.  As  a  result,  Eq.  (5.1)  will  be  applicable  to  this 
new  trust  network. 

5.1.3  The  Proposed  Framework  -  NCSSN 

In  this  section,  we  will  first  introduce  a  node  classification  algorithm  with  only 
trust  relations,  and  then  give  details  about  how  to  capture  distrust  relations  and  the 
proposed  NCSSN  framework. 
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Figure  5.2:  Examples  of  Converting  Distrust  Relations  into  Trust  Relations  accord¬ 
ing  to  Status  Theory. 


A  Node  Classification  Algorithm  with  Only  Trust  Relations 

To  perform  the  classification,  we  associate  latent  factors  with  nodes.  Let  U*  G 
be  the  latent  factor  of  rq.  Then,  a  trust  relation  from  ut  to  Uj  can  be  modeled  as 
the  interaction  between  their  latent  factors  as  Ty  =  U,H'Uj  where  FT  G  MAxA 
captures  the  interaction  for  trust  relations  [77].  We  use  a  linear  classifier  W  G  MArxm 
to  capture  the  label  information  from  labeled  nodes  based  on  their  latent  factors  as 
Y i  =  UjW.  The  proposed  node  classification  algorithm  with  only  trust  relations 
solves  the  following  optimization  problem: 


mm 

Hp>0,U>0,W 


X]  ||Tjj  —  U;tTUj III  +  a  Y,  ||U, W  —  Y 


2 
*112 


(5.3) 


Ui,Uj&U  Ui&JL 

where  a  controls  the  contribution  from  labeled  nodes. 

The  Eq.  (5.3)  can  be  rewritten  in  matrix  form  as  follows: 


min  I  IT  -  UH'Ut||2  +  d|C(UW  -  Y)||  p  (5.4) 

HP>0,U>0,W  V 

where  U  =  [Up  U2; . . . ;  Un]  G  M+xA:  and  C  G  MnXTl  is  a  diagonal  matrix,  where 
C a  —  1  if  Ui  G  UL  and  C a  =  0,  otherwise. 
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Capturing  Distrust  Relations 


Independent  information  from  distrust  relations  is  crucial  to  account  for  distinct  topo¬ 
logical  properties  of  distrust  relations  in  modeling  [69];  while  as  suggested  in  [38], 
trust  and  distrust  relations  should  also  be  viewed  as  tightly  related  features  in  a  sin¬ 
gle  structure.  Therefore,  we  capture  two  types  of  information  when  modeling  distrust 
relations.  One  is  information  from  only  distrust  relations  which  we  refer  to  as  inde¬ 
pendent  information  from  distrust  relations.  The  other  is  information  derived  from 
the  interactions  between  trust  and  distrust  relations  which  we  refer  to  as  dependent 
information  from  distrust  relations. 

To  capture  independent  information,  which  is  similar  to  modeling  trust  relations, 
a  distrust  relation  from  ut  to  Uj  can  be  modeled  as  the  interaction  between  their 
latent  factors  as  follows: 

D ij  =  l  II  l  (5.5) 

Here,  the  notation  H(/  e  MAxA  is  introduced  to  capture  the  interaction  for  distrust 
relations. 

The  notion  of  structural  balance  was  extended  by  [15].  According  to  this  principle, 
a  structure  in  a  trust / distrust  network  should  ensure  that  users  are  able  to  have  their 
“friends  “  closer  than  their  “foes”  i.e.,  users  should  sit  closer  to  their  “friends”  (or 
users  with  trust  relations)  than  their  “foes”  (or  users  with  distrust  relations).  For 
(i,j,  k)  where  Ui  has  a  distrust  relation  to  Uj  and  a  distrust  relation  to  Uk,  we  force  Ui 
closer  to  her  “friend”  Uj  than  her  “foe”  Uk  in  terms  of  their  latent  factors  to  capture 
dependent  information  according  to  the  aforementioned  theory.  To  achieve  this  goal, 
we  consider  the  following  two  cases: 

•  Case  1:  If  a  user  u,  sits  closer  to  her  “friend”  Uj  than  her  “foes”  it*,,  i.e., 
l|Uz-Uy||l- ||U,-Ufc|||  <  0,  we  should  not  penalize  this  case  since  we  expect 
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it. 

•  Case  2:  If  a  user  vn  sits  closer  to  her  “enemy”  Uk  than  her  “friend”  Uj,  i.e., 
||Ui  —  Uj-lll  —  ||Ui  —  Ufclll  >  0,  we  should  add  a  penalty  to  pull  U{  closer  to  Uj 
than  Uk- 

Based  on  the  aforementioned  analysis,  we  propose  the  following  formulation  to 
capture  dependent  information  from  distrust  relations  as 

min  max(0,  ||Uj  —  Uj||2  —  ||Uj  —  Ufclll)  (5.6) 

(i,j,k)es 

S  is  the  set  of  (■ i,j ,  k)  where  U{  has  a  trust  relation  to  Uj  while  has  a  distrust  relation 
to  Uk,  which  is  formally  defined  as 

S  =  {(i,j,k)  |A^  =  1AA”  =1}  (5.7) 

Next,  we  give  details  on  the  inner  workings  of  Eq.  (5.6): 

•  When  a  user  U{  sits  closer  to  her  “friend”  Uj  than  her  “foe”  Uk,  i.e.,  Case  1,  the 
minimizing  term  in  Eq.  (5.6)  is  0  since  || XJ*  —  U ^ 1 1 1  —  || U*  —  XJfc|||  <  0.  We  do 
not  add  any  penalty  for  Case  1 ; 

•  When  a  user  ut  sits  closer  to  her  “foe”  Uk  than  her  “friend”  Uj,  i.e.,  Case  2,  the 
minimizing  term  in  Eq.  (5.6)  is  || U*  —  XJ j- 1 1 1  —  ||Ui  —  U fc || |  since  || U*  —  Uj|||  — 
||Ui  —  Ufclll  >  0.  Eq.  (5.6)  will  pull  tq  closer  to  Uj  than  Uk  for  Case  2. 

Eq.  (5.6)  can  be  rewritten  in  the  matrix  form  as  follows: 

Y  max( OjUi-U^-jlUi  ^Ufc||l)  =  ^  fijkTr(MijkVVT)  (5.8) 
(i,j,k)es  (i,j,k)es 


61 


where  M^k  has  Mg*  =  M)f  =  Mg*  =  -1  and  Mg*  =  M/f  =  Mg*  =  1  with  other 
entries  equal  to  zero.  The  term  is  defined  as  follows: 

{1  if  Tr(MijfcUUT)  >  0 

(5.9) 

0  otherwise 

With  models  of  independent  and  dependent  information  from  distrust  relations, 
we  propose  the  following  formulation  to  leverage  distrust  relations: 

min  1 1  A”  —  UH"UT||^  +  fijkTr(MijkVVT)  (5.10) 

(i,j,k)eS 

The  Proposed  Formulation  for  NCSSN 

Combining  Eqs.  (5.4)  and  (5.10),  the  proposed  node  classification  framework  in 
trust /distrust  networks  NCSSN  is  to  solve  the  following  optimization  problem: 

min  NT  -  UHfUT|||  +  d|C(UW  -  Y)||! 

H‘>0,Hd>0,U>0,W 

+  /3(||D-UHc,Ut||^+  ^  fljkTr{  MijW)) 

(i,j,k)es 

+  A(||H*|||  +  ||Hd||!  +  \\XJ\\2f  +  \\W\\l)  (5.11) 

where  the  first  term  to  the  fourth  term  capture  trust  information,  label  information 
from  labeled  nodes,  independent  information  from  distrust  relations  and  dependent 
information  from  distrust  relations,  respectively.  The  term  A(||H*||^+||Hci|||1+||U|||1+ 
||W|||i)  is  introduced  to  avoid  overfitting.  The  relationships  among  U,  FT,  Hd  and 
W  make  the  problem  of  hireling  optimal  solutions  for  all  parameters  in  Eq.  (5.11) 
difficult  to  determine  simultaneously.  In  this  work,  we  adopt  an  alternate  optimization 
scheme  [16]  for  Eq.  (5.11)  where  we  optimize  one  component  while  fixing  others.  More 
details  are  shown  in  Appendix  C.  After  we  learn  W  and  U,  the  label  of  a  node  in 
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the  trust /distrust  network  Ui  G  Uu  can  be  predicted  as  follows: 


c*  —  argmax([UjW]j)  (5-12) 

c,-ec 

5.1.4  Evaluation 

Dataset  and  Experimental  Settings 

Epinions  can  write  reviews  for  products  from  various  categories.  We  chose  these 
categories  as  the  class  labels  of  users.  For  a  user  who  writes  reviews  for  products 
from  multiple  categories,  we  chose  the  one  with  most  products  she  writes  reviews  to 
as  her  label.  We  perform  additional  preprocessing  on  the  original  Epinions  dataset  by 
filtering  users  without  any  labels,  and  class  labels  with  a  limited  number  of  users.  The 
new  dataset  includes  23,280  users,  291,422  trust  relations,  40,792  distrust  relations 
and  20  class  labels. 

In  each  case,  we  randomly  choose  x%  of  nodes  labeled  and  the  remaining  l  —  x%  as 
unlabeled  nodes  for  testing.  For  each  x,  we  repeat  the  experiments  10  times  and  report 
the  average  performance.  Since  it  is  very  common  for  labels  to  be  sparsely  specified, 
we  chose  relatively  small  values  of  x,  which  were  {5, 10, 15,  20}.  One  commonly  used 
measure,  referred  to  as  Micro-FI,  is  adopted  to  assess  the  classification  performance. 

Node  Classification  Performance 

In  this  subsection,  we  evaluate  the  classification  performance  with  trust /distrust  net¬ 
works  in  terms  of  (a)  the  performance  of  algorithms  transformed  from  trust  networks, 
and  (b)  the  performance  of  the  proposed  NCSSN  framework  with  respect  to  these 
transformed  methods. 

The  comparison  results  are  demonstrated  in  Table  5.1.  The  algorithms  in  the 
table  are  defined  as  follows: 
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•  ICA :  This  algorithm  is  a  traditional  iterative  classification  method  [44]  with 
features  LFIP,  LF0p  and  SFP.  We  apply  ICA  to  trust /distrust  networks  by 
ignoring  all  distrust  relations. 

•  sICAS :  This  algorithm  is  a  transformed  version  of  ICA  for  trust /distrust  net¬ 
works.  For  SFpn,  we  chose  circles  of  length  3  as  features. 

•  sICAlP  This  algorithm  is  also  a  transformed  version  of  ICA.  Different  from 
sICAS,  sICA4  uses  circles  of  length  4  as  features. 

•  GReg:  This  algorithm  is  a  traditional  graph  regularization  method  [105]  for  trust 
networks.  We  ignore  all  distrust  relations  when  we  apply  GReg  to  trust /distrust 
networks. 

•  disGReg:  This  algorithm  is  a  variant  of  GReg,  which  considers  a  distrust  relation 
as  dissimilarity. 

•  sGReg:  This  algorithm  is  also  a  variant  of  GReg.  To  apply  GReg  from  trust 
to  trust /distrust  networks,  sGReg  converts  distrust  relations  to  trust  relations 
according  to  status  theory. 

•  NCSSN:  This  is  the  proposed  algorithm  which  models  both  independent  and 
dependent  information  to  capture  distrust  relations. 

•  Random,  this  algorithm  chooses  class  labels  randomly  for  unlabeled  nodes. 

For  methods  with  parameters,  we  used  cross-validation  to  determine  their  values. 
However,  disGReg  achieves  the  best  performance  when  rj  —  0.  This  means  that  the 
distrust  relations  in  disGReg  reduce  its  performance.  To  show  the  impact  of  distrust 
relations  on  disGReg,  we  report  the  performance  when  rj  —  1  in  Table  5.1.  Note  that 
for  ICA,  sICAS  and  sICA4,  we  try  various  types  of  traditional  classifiers  including 
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Algorithms 

Epinions 

5% 

10% 

15% 

20% 

ICA 

10.57 

11.00 

11.45 

11.99 

sICAS 

11.09 

11.61 

12.03 

12.52 

sICA4 

11.75 

11.98 

12.36 

13.03 

GReg 

9.71 

11.02 

11.51 

12.10 

disGReg 

9.39 

10.70 

11.21 

11.80 

sGReg 

10.34 

11.68 

12.17 

12.56 

NCSSN 

12.08 

12.63 

13.22 

13.87 

Random 

4.98 

5.00 

5.00 

5.02 

Table  5.1:  Performance  Comparison  of 


Negative  Link  Prediction  in  Epinions. 


Naive  Bayes,  SVM,  logistic  regression,  linear  regression,  and  random  forests,  and  we 
report  the  best  performance. 

We  make  some  key  observations  from  Table  5.1: 

•  In  general,  with  the  increase  in  the  number  of  labeled  nodes,  the  classification 
performance  consistently  increases  for  all  methods  in  Table  5.1. 

•  The  relative  performance  improvement  of  sICAS  and  sICA4  compared  to  ICA 
is  shown  in  Table  5.2.  Both  sICAS  and  sICA4  outperform  ICA.  These  results 
support  the  contention  that  distrust  relations  are  helpful  in  the  node  classifica¬ 
tion  problem.  sICAJ^  often  outperforms  sICAS.  Since  trust /distrust  networks 
are  usually  very  sparse,  some  users  may  not  have  any  length  3  circle  features 
but  have  4- length  circle  features;  hence  length  4  circle  features  are  more  robust. 

•  The  relative  performance  improvement  of  disGReg  and  sGReg  compared  to  GReg 
is  shown  in  Table  5.3.  disGReg  performs  worse  than  GReg ,  which  suggests 
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sIC A3 

sICA4 

5% 

+4.92% 

+11.16% 

10% 

+5.55% 

+8.91% 

15% 

+5.07% 

+7.95% 

20% 

+4.42% 

+8.67% 

Table  5.2:  Relative  Performance  Improvement  of  sICAS  and  slCAJ{  Compared  to 
ICA. 


disGReg 

sGReg 

5% 

-3.30% 

+6.49% 

10% 

-2.90% 

+5.73% 

15% 

-2.61% 

+5.99% 

20% 

-2.48% 

+3.80% 

Table  5.3:  Relative  Performance  Improvement  of  disGReg  and  sGReg  Compared  to 
GReg. 


that  a  distrust  relation  may  not  denote  dissimilarity  and  transforming  GReg  to 
trust/distrust  by  considering  a  distrust  relation  as  dissimilarity  may  not  work. 
These  observations  are  consistent  with  our  previous  observations  in  Epinions 
that  distrust  in  Epinions  may  not  be  dissimilarity  measurements  [83].  sGReg 
obtains  better  performance  than  GReg.  Transforming  GReg  to  trust /distrust 
networks  by  converting  distrust  relations  to  trust  relations  according  to  status 
theory  can  improve  classification  performance. 

•  The  proposed  framework  NCSSN  always  obtains  the  best  performance.  NCSSN 
models  independent  and  dependent  information  to  capture  distrust  links  and 
the  contributions  of  these  two  components  to  the  performance  improvement 
from  NCSSN  will  be  discussed  in  the  following  subsection. 
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In  summary,  the  aforementioned  analysis  provides  the  insights  that  (a)  trans¬ 
formed  algorithms  from  trust  to  trust/distrust  networks  properly  can  improve  the 
classification  performance;  (b)  a  distrust  relation  may  not  denote  dissimilarity;  and 
(c)  the  proposed  NCSSN  framework  obtains  significant  performance  improvement 
compared  to  other  methods. 

Component  Analysis  for  NCSSN 

Based  on  the  performance  comparison  in  the  previous  subsection,  we  observe  that 
the  proposed  NCSSN  framework  improves  the  classification  performance  significantly. 
To  capture  distrust  relations,  NCSSN  provides  two  components  ||An  —  UHnUT||p  + 
^(i  j  k)&S  fijkTr( MyfcUUT)  to  model  independent  and  dependent  information,  re¬ 
spectively.  To  study  the  impact  of  distrust  relations  on  the  proposed  framework,  we 
systematically  eliminate  the  effects  of  these  two  components  by  defining  the  following 
variants  of  NCSSN. 

•  NCSSN\II  -  We  eliminate  the  effect  of  the  independent  component  on  NCSSN. 
In  particular,  we  remove  the  term  ||An  —  UHnUT||^  from  the  optimization 
problem  in  Eq.  (5.11). 

•  NCSSN\DI  -  We  eliminate  the  effect  of  the  dependent  component  on  NCSSN 
by  removing  the  term 

j  k)es  fijkTr(Mlik\J\JT)  from  the  optimization  problem  in  Eq.  (5.11). 

•  NCSSN\DII  -  We  eliminate  the  effects  of  both  independent  and  dependent 
components  on  NCSSN  by  setting  f3  —  0  in  Eq.  (5.11). 

The  parameters  in  all  variants  are  determined  with  cross-validation  and  the  per¬ 
formance  comparison  of  NCSSN  and  its  variants  are  demonstrated  in  Figure  5.3. 
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Figure  5.3:  The  Impact  of  Distrust  Relations  on  The  Proposed  Framework. 


When  we  eliminate  the  effect  of  the  independent  component,  the  performance 
of  NCSSN\II  degrades  in  comparison  with  NCSSN.  For  example,  the  performance 
reduces  6.95%  with  5%  of  labeled  users  in  Epinions.  These  results  suggest  that 
the  independent  component  in  NCSSN  is  important.  We  make  similar  observations 
for  NCSSN\DI  when  we  eliminate  the  effect  of  the  component  modeling  dependent 
information.  When  we  eliminate  the  effects  of  both  components,  the  performance  of 
NCSSN\DII  further  reduces  compared  to  NCSSN\II  and  NCSSN\DI.  These  results 
suggest  that  these  components  are  complementary  to  each  other. 

In  summary,  via  the  component  analysis  of  NCSSN,  we  conclude  that  (a)  both 
components  can  contribute  to  the  performance  improvement  of  NCSSN,  (b)  it  is 
necessary  to  model  both  because  they  contain  complementary  information. 


5.2  Recommendation 


In  the  physical  world,  we  always  seek  recommendations  from  our  trusted  friends, 
which  suggests  that  trust  information  may  be  useful  to  improve  recommendation 
performance.  Many  recommender  systems  are  proposed  to  incorporate  ones’  trust 
networks  for  recommendation  and  gain  performance  improvement  [52,  46,  27,  47,  28, 
82,  91].  Scholars  have  noted  that  distrust  information  may  be  more  noticeable  and 
credible  than  trust  information  with  a  similar  magnitude  [12],  In  this  section,  we 


investigate  how  to  exploit  trust/distrust  networks  for  recommendation  [72], 

5.2.1  Problem  Statement 

In  a  typical  recommender  system,  there  is  a  user-item  rating  matrix  R  G  Wixl 
where  /  is  the  number  of  the  set  of  items  V  =  {v\,V2, . . .  ,vj},  Ry-  the  rating  score 
if  Ui  rates  Vj,  and  0  otherwise.  Let  O  =  {(«*,  Vj)|R.jj  /  0}  be  the  set  of  observed 
ratings  and  AA  =  {(rq,  v/)  [R^  /  0}  be  the  set  of  missing  ratings.  The  problem  of 
recommendation  with  trust /distrust  networks  can  be  formally  stated  as  follows: 

Given  observed  ratings  O  and  a  trust/ distrust  network  Q  with  trust  relations  T, 
and  distrust  relations  D,  the  problem  of  recommendation  with  a  trust/ distrust  network 
aims  to  infer  missing  values  AA  in  R. 

5.2.2  A  Recommendation  Framework  with  Trust/Distrust  Networks  -  RecSSN 

Two  types  of  information  from  trust  networks  can  be  exploited  for  recommenda¬ 
tion,  which  correspond  to  local  information  and  global  information  [87].  Local  in¬ 
formation  reveals  the  correlations  among  the  user  and  his/her  trusted  friends,  while 
global  information  reveals  the  reputation  of  the  user  in  the  whole  network.  Users  in 
the  physical  world  are  likely  to  ask  for  suggestions  from  their  local  friends  while  they 
also  tend  to  seek  suggestions  from  users  with  high  global  reputation.  This  suggests 
that  both  local  and  global  information  can  be  exploited  in  trust  networks  to  improve 
the  performance  of  recommender  systems  [81].  In  the  following  subsections,  we  will 
first  provide  details  about  the  methods  for  capturing  local  and  global  information  in 
trust /distrust  networks,  and  then  introduce  the  proposed  RecSSN  framework. 

Matrix  factorization  is  chosen  as  our  basic  model  because  it  is  one  of  the  most 
popular  techniques  for  building  recommender  systems  [35,  34],  Assume  that  U*  G  MA 
is  the  K- -dimensional  preference  latent  factor  of  and  Vj  G  MA  is  the  K- -dimensional 
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characteristic  latent  factor  of  item  j.  Typically,  scores  from  ut  to  Vj  in  R?J  are  modeled 
by  the  interactions  between  their  latent  factors.  This  interaction  is  defined  in  terms 
of  the  product  of  the  latent  vectors: 

Ry  =  UjVj  (5.13) 

Matrix  factorization-based  recommcnder  systems  solve  the  following  optimization 
problem: 

n  m 

min  ^^Wy||Ry-U^||!  +  «(||U||^+[|V||P  (5.14) 

i=l  j= 1 

where  U  =  {U1}  U2, . . . ,  Un}  and  V  =  {Vi,  V2, . . . ,  Vm}.  Wy  controls  the  contri¬ 
bution  from  Rjj,  and  the  term  ||U||p  +  ||V|||.  is  added  to  avoid  overfitting. 

Capturing  Local  Information  from  Trust/Distrust  Networks 

The  local  information  in  trust /distrust  networks  is  about  the  preference  relations 
between  users,  and  their  “friends”  (or  users  with  positive  links)  and  “foes”  (or  users 
with  negative  links).  Next,  we  introduce  our  approach  to  capture  local  information 
from  trust/distrust  networks  based  on  the  findings  of  the  previous  section. 

Let  Vi  and  Mi  be  ?i/s  friend  circle,  including  users  who  have  trust  relations  with 
■Uj,  and  foe  circle,  including  users  who  have  distrust  relations  with  Ut,  respectively. 
Based  on  Vt  and  Mi,  we  can  divide  users  into  three  groups  as  below: 

•  OV  includes  users  who  have  only  trust  links  as  -  OV  =  {u.i\Vi  ^  0  fl  Mi  =  0}; 

•  OM  includes  users  who  have  only  distrust  links  as  -  OM  =  {iii\Vi  =  0HA/)  ^  0}; 

•  VM  contains  users  who  have  both  trust  and  distrust  links  as  -  VM  =  {ui\Vi  ^ 

0  u  Mi  ^  0}. 
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We  define  Uf  and  £/"  as  the  average  user  preferences  of  m/s  friend  circle  and  foe 
circle,  respectively,  as  follows: 


Up  = 


Thuj&Vi 


un  = 


Yluj&jVi  ^pU? 

Ylru.j&K 


(5.15) 


where  is  the  connection  strength  between  m*  and  Uj.  Next,  we  will  discuss  how  to 
capture  local  information  for  these  groups  separately: 


•  For  a  user  m*  with  only  friend  circle  (or  m*  G  OV),  our  previous  finding  suggests 
that  Mj’s  preference  is  likely  to  be  similar  with  her  friend  circle.  Hence,  we  force 
Mi’s  preference  close  to  V,  by  minimizing  the  following  term: 

min  ||Ui  —  tJf  Hi-  (5.16) 


•  For  a  user  Mi  with  only  foe  circle  (or  Mi  G  OAT),  this  user  is  likely  to  be  untrust¬ 
worthy  and  we  should  not  consider  this  user  for  the  purpose  of  recommenda¬ 
tion  [95].  Therefore,  we  ignore  local  information  from  these  users  with  only  foe 
circles,  which  are  only  a  small  portion  of  the  users  in  real-world  trust /distrust 
networks.  For  example,  in  the  studied  dataset,  there  are  less  than  5%  of  users 
with  only  foe  circles. 

•  For  a  user  Ui  with  both  friend  and  foe  circles,  our  previous  finding  suggests  that 
the  preference  of  m,  is  likely  to  be  closer  to  that  of  his/her  friend  circle  than  that 
of  his/her  foe  circle.  In  other  words,  (1)  if  a  user  ul  sits  closer  to  his/her  friend 
circle  V%  than  her  foe  circle  A//  i.e. ,  ||U;  —  Uf  |||  —  ||Uj  —  U"|||  <  0,  we  should 
not  penalize  this  case;  while  (2)  if  a  user  u,  sits  closer  to  his/her  foe  circle  A/) 
than  her  friend  circle  V, ,  i.e.,  ||U,  —  Uf  |||  —  ||Uj  —  U”|||  >  0,  we  should  add 
a  penalty  to  pull  'ut  closer  to  V,  than  A/).  Therefore,  we  propose  the  following 
minimization  term  to  force  m/s  preference  closer  to  Vt  than  A/)  as: 

min  max(0,  ||Ut  -  U?|||  -  ||U?:  -  U/||^)  (5.17) 
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Next,  we  give  details  on  the  inner  workings  of  Eq.  (5.17).  (1)  When  tq  sits 
closer  to  his/her  friend  circle  Vi  than  his/her  foe  circle  A/),  the  minimizing  term 
in  Eq.  (5.17)  is  0  because  || U*  —  Uf  || |  —  ||Uj  —  U”|||  <  0  and  we  do  not  add 
any  penalty;  and  (2)  when  tq  sits  closer  to  her  foe  circle  A/)  than  her  friend 
circle  Vi,  the  minimizing  term  in  Eq.  (5.17)  is  ||Uj  —  Uf  |||  —  ||U*  —  U”  because 
||Ut  —  Uf  |||  —  || Ui  —  U"  >  0  and  Eq.  (5.17)  will  pull  iq  back  to  Vt  from  A/). 


We  can  develop  a  unified  term  to  capture  local  information  from  these  three  groups 
in  trust /distrust  networks  with  the  following  observations  -  (1)  if  we  define  U”  =  U* 
for  Ui  in  OV,  the  term  for  OV  is  equivalent  to  max(0,  ||U/  —  Uf|||  —  ||U;  — U”|||);  and 
(2)  if  we  define  U"  =  U,:  for  Ui  in  OAf,  the  term  max(0,  ||U;  —  Uf|||  —  ||U*  —  U”|||) 
is  0  for  OAf,  which  indicates  that  we  ignore  the  impact  of  users  from  OAf .  Therefore 
by  redefining  Uf  and  Uf  as, 


U.t 

Ut 


for  Ui  G  OV  U  VAS ’ , 
for  Ui  G  OAf. 
for  Ui  G  OAf  U  VAf, 
for  Ui  G  OV, 


(5.18) 


we  can  find  a  unified  term  to  capture  local  information  from  trust /distrust  networks 
as: 


min  max(0,  ||Ui  -  Uf  ||1  -  ||Ui:  -  U! 


(5.19) 


i=  1 


Capturing  Global  Information  from  Signed  Social  Networks 


The  global  information  of  a  trust /distrust  network  reveals  the  reputation  of  a  user 
in  the  whole  network  [52],  User  reputation  is  a  sort  of  status  that  gives  additional 
powers  and  capabilities  in  recommender  systems  [81].  There  are  many  algorithms  to 
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calculate  the  reputations  of  nodes  in  trust  networks  [61,  32],  However,  a  small  num¬ 
ber  of  distrust  links  can  significantly  affect  the  status  of  the  nodes,  which  suggests 
that  we  should  consider  distrust  links.  Therefore,  we  choose  a  variant  of  Pagerank, 
Exponential  Ranking  [93],  taking  into  account  distrust  links  to  calculate  user  reputa¬ 
tions.  In  detail,  we  first  perform  Exponential  Ranking  to  rank  users  by  exploiting  the 
global  information  of  trust /distrust  networks.  We  assume  that  G  {1,2, .. .  ,N}  is 
the  reputation  ranking  of  Ui  where  rl  =  1  denotes  that  u-L  has  the  highest  reputation 
in  the  trust  network.  Then  we  define  user  reputation  score  w *  as  a  function  /  of 
user  reputation  ranking  ry  w,  =  /(r*)  where  the  function  /  limits  the  value  of  the 
reputation  score  Wj  within  [0, 1]  and  is  a  decreasing  function  of  rtl  i.e.,  top-ranked 
users  have  high  reputation  scores. 

In  the  physical  world,  user  reputation  plays  an  important  role  in  recommenda¬ 
tion.  Many  companies  employ  people  with  high  reputations  to  enhance  consumers’ 
awareness  and  understanding  of  their  products.  Seno  and  Lukas  found  that  sugges¬ 
tions  from  people  with  high  reputations  positively  affect  a  consumer’s  adoption  of 
a  brand  [66].  While  in  the  online  world,  Massa  found  that  recommendations  from 
users  with  high  reputations  are  more  likely  to  be  trustworthy  [52] .  To  capture  global 
information  from  trust /distrust,  we  can  use  user  reputation  scores  to  weight  the  im¬ 
portance  of  their  recommendations.  Originally  the  importance  of  R,:j  in  Eq.  (5.14) 
is  controlled  by  W \j.  With  trust /distrust  networks,  we  should  also  consider  the  rep¬ 
utation  of  Ui ;  hence  we  define  the  new  weight  for  R/;  as  Wy-  =  r/W^jW,)  where  g 
is  a  function  to  combine  two  weights.  With  these  new  weights,  the  formulation  to 
capture  global  information  from  trust /distrust  networks  is  computed  as  follows: 

N  m 

millEE»(w«-w')llR«  -  u.Tv>H’  +  “(IIUIIf  +  IIVIIP  (5,20) 

*— 1  j= 1 

where  the  importance  of  Ry-  is  controlled  by  Wy  and  the  reputation  score  of  u* 
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through  a  function  g. 


An  Optimization  Algorithm  for  RecSSN 

We  have  introduced  our  approaches  to  capture  local  and  global  information  from 
trust /distrust  networks.  With  these  model  components,  we  propose  a  recommenda¬ 
tion  framework,  RecSSN,  which  exploits  local  and  global  information  simultaneously 
from  trust /distrust  networks.  The  proposed  RecSSN  framework  solves  the  following 
optimization  problem: 

N  m 

minE  E9<W«-w‘)II(R«  -  U‘V7)ll2  +  afllUIIJ.  +  IIVIIP 

i=  1  3=1 
n 

+  max(°’  HU*  -  Uflll  -  ||Ui  -  urill)  (5.21) 

i— 1 

where  f3  Ya=i  max(°>  ||Uj  -  Uf|||  -  ||Uj  -  U"||!)  captures  local  information  from 
trust /distrust  networks  and  the  parameter  f3  controls  its  contribution.  The  term 
g(Wij,Wi )  is  introduced  to  capture  global  information  from  trust /distrust  networks. 

By  setting  g(Wy,  w/  =  Wy  and  ignoring  all  distrust  links,  the  proposed  formu¬ 
lation  for  RecSSN  in  Eq.  (5.21)  can  be  written  as  follows: 

N  m  n 

min  ^Wy||(Ry  -UiVjJIII  +  adlUIIJ  +  IIVIIW+^IIU,  -  Uflli  (5,22) 

i= 1  j= 1  i=  1 

Interestingly,  this  formulation  is  equivalent  to  one  of  the  state-of-the-art  recommender 
systems  with  trust  networks  SocialMF  [27].  Therefore,  RecSSN  provides  a  unified 
recommendation  framework  with  trust  and  trust /distrust  networks.  Eq.  (5.21)  is 
jointly  convex  with  respect  to  U  and  V  and  there  is  no  nice  solution  in  closed  form 
due  to  the  use  of  the  max  function.  A  local  minimum  can  be  obtained  through  the 
gradient  decent  optimization  method  in  Appendix  D,  which  usually  works  well  for 
recommender  systems  [35].  After  learning  the  user  preference  matrix  U  and  the  item 
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characteristic  matrix  V,  an  unknown  score  R.;y  from  the  user  u\  to  the  item  w'  will 
be  predicted  as  R jy  =  u Jvy. 

5.2.3  Evaluation 

In  this  section,  we  conduct  experiments  to  answer  the  following  two  questions  - 
(1)  can  the  proposed  RecSSN  framework  improve  the  recommendation  performance 
by  exploiting  trust /distrust  networks?  and  (2)  which  model  components  of  RecSSN 
contribute  to  the  performance  improvement?  Before  answering  these  questions,  we 
begin  by  introducing  the  experimental  settings. 

Experimental  Settings 

Following  common  ways  to  assess  recommendation  performance  in  rating  systems,  we 
choose  two  metrics,  corresponding  to  the  Root  Mean  Square  Error  (RMSE)  and  the 
Mean  Absolute  Error  (MAE),  which  are  formally  defined  as  follows: 

It.,...  .)<=-r  (Ri>  ~  r»)2 

RMSE  =  y  {  ^  |T|  - — , 

MAE  =  — —  |Rjj  —  Rql,  (5.23) 

(ai,wj)eT 

where  T  is  the  set  of  ratings  in  the  testing  set,  \T\  is  the  size  of  T  and  Ry  is  the 
predicted  rating  from  ui  to  vr  A  smaller  RMSE  or  MAE  value  means  better  perfor¬ 
mance.  Note  that  previous  work  demonstrated  that  small  improvement  in  RMSE  or 
MAE  terms  can  have  a  significant  impact  on  the  quality  of  the  top  few  recommenda¬ 
tions  [33].  In  this  work,  we  choose  x%  of  rating  scores  as  training  and  the  remaining 
1  —  x%  as  testing,  and  x  is  varied  as  {50,  70,  90}. 
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Performance  Comparison  of  Recommender  Systems 


To  answer  the  first  question,  we  compare  the  proposed  RecSSN  framework  with  exist¬ 
ing  recommender  systems.  Traditional  collaborative  filtering  systems  can  be  grouped 
into  memory-based  systems  and  model-based  systems;  hence  we  choose  two  groups 
of  baseline  methods. 

The  first  group  of  baseline  methods  includes  the  following  memory-based  systems: 

•  UCF:  This  system  makes  recommendations  by  aggregating  recommendations 
from  ones’  similar  users  only  based  on  the  user-item  matrix. 

•  pUCF :  This  system  is  a  variant  of  UCF.  which  combines  recommendations 
from  ones’  similar  users  and  their  trust  friends  [52],  pUCF  utilizes  both  user- 
item  matrix  and  trust  links. 

•  pnUCF :  This  system  is  a  variant  of  pUCF.  which  excludes  recommendations 
from  ones’  foes  by  exploiting  distrust  links  [95].  pnUCF  makes  use  of  user- item 
matrix,  trust  and  distrust  links. 

The  second  group  of  baseline  methods  includes  the  following  model-based  systems: 

•  MF:  This  system  performs  matrix  factorization  on  the  user-item  matrix  as 
shown  in  Eq.  (5.14)  [64].  It  only  utilizes  the  user-item  matrix. 

•  SocialMF:  This  system  combines  both  user-item  matrix  and  trust  links  for 
recommendation  [27],  which  is  a  special  case  of  the  proposed  framework  with 
only  trust  links  as  shown  in  Eq.  (5.22). 

•  SoReg:  This  system  also  leverages  both  user-item  matrix  and  trust  links,  and 
defines  social  regularization  to  capture  trust  links  [47]. 
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•  LOCABAL:  This  system  captures  local  and  global  information  of  trust  links 
under  the  matrix  factorization  framework  [81]. 

•  disSoReg:  In  [45],  two  systems  are  proposed  to  exploit  trust  and  distrust 
links,  respectively.  disSoReg  is  a  combination  of  these  two  systems  to  exploit 
trust  and  distrust  links  simultaneously,  which  is  actually  a  variant  of  SoReg 
by  considering  distrust  links  as  dissimilarity  measurements. 

Note  that  we  use  cross-validation  to  determine  parameters  for  all  baseline  methods. 
For  RecSSN,  /3  is  set  to  0.7.  We  empirically  set  a  =  0.1  and  the  number  of  latent 
factors  K  =  10  for  both  datasets.  In  Eq.  (5.20),  we  empirically  find  that  f(x)  = 
/ot/(r+i)  and  9{xiV)  =  x  *  y  work  well.  The  comparison  results  are  demonstrated  in 
Tables  5.4. 

We  make  the  following  observations: 

•  In  general,  model-based  methods  outperform  memory-based  methods  on  the 
two  studied  datasets.  Most  of  the  existing  recommender  systems  suffer  from 
the  data  sparsity  problem  but  model-based  methods  are  usually  less  sensitive 
than  memory-based  methods  [33]. 

•  pUCF  outperforms  UCF.  Furthermore,  SocialMF,  SoReg  and  LOCABAL 
outperform  MF.  These  results  support  the  known  contention  that  exploiting 
trust  links  can  significantly  improve  recommendation  performance. 

•  LOCABAL  exploits  local  and  global  information  from  trust  links,  and  obtains 
better  performance  than  the  systems  which  model  only  local  information  from 
trust  links  such  as  SocialMF  and  SoReg.  These  observations  indicate  the 
importance  of  global  information  for  recommendation. 
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•  pnUCF  obtains  better  performance  than  pUCF.  which  suggests  that  excluding 
recommendations  from  users  with  distrust  links  can  improve  recommendation 
performance.  Furthermore,  disSoReg  performs  worse  than  SoReg.  These 
results  suggest  that  we  may  not  consider  distrust  links  as  dissimilarities  in 
recommendation,  which  is  consistent  with  observations  in  [83]. 

•  The  proposed  RecSSN  framework  always  obtains  the  best  performance.  RecSSN 
captures  local  and  global  information  from  trust /distrust  networks.  In  addition 
to  trust  links,  trust /distrust  networks  also  provide  distrust  links.  More  de¬ 
tails  about  the  effects  of  distrust  links  on  the  performance  of  RecSSN  will  be 
discussed  in  the  following  subsection. 

With  these  observations,  we  can  draw  conclusions  about  the  first  question  -  the 
proposed  RecSSN  framework  outperforms  the  state-of-the-art  recommender  systems 
by  exploiting  local  and  global  information  from  trust /distrust  networks. 
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Impact  of  Distrust  Links  on  RecSSN 


We  will  now  focus  on  the  second  issue  of  examining  the  precise  impact  of  distrust 
links  on  RecSSN.  The  experimental  results  in  the  previous  subsection  show  that 
the  proposed  RecSSN  framework  outperforms  various  representative  recommender 
systems  with  trust  networks.  Compared  to  these  systems,  RecSSN  also  leverages 
information  from  distrust  links.  In  this  subsection,  we  investigate  the  impact  of 
distrust  links  on  the  proposed  RecSSN  framework  to  answer  the  second  question.  In 
particular,  we  eliminate  the  effects  of  distrust  links  systematically  from  RecSSN  by 
defining  the  following  algorithmic  variants: 

•  RecSSN\  GN  -  Eliminating  the  effect  of  distrust  links  from  global  information 
of  trust /distrust  networks  by  using  Pagerank  to  calculate  status  scores  of  users 
with  only  trust  links. 

•  RecSSN\LN  -  Eliminating  the  effect  of  distrust  links  from  local  information  of 
trust /distrust  networks  by  replacing  ^”=1max(0,  ||Uj  —  U|>|||  —  ||Uj  —  U”|||) 
with  Yh=i  l|Ui  -  Uf|||  in  Eq.  (5.21). 

•  RecSSN\  GN-LN  -  Eliminating  the  effects  of  distrust  links  from  global  and  local 
information  of  trust /distrust  networks. 

The  parameters  in  all  these  variants  are  determined  via  cross-validation.  The  ex¬ 
perimental  results  are  demonstrated  in  Figure  5.4.  In  general,  eliminating  any  model 
component  which  captures  the  effect  of  distrust  links  will  reduce  the  recommendation 
performance.  The  relative  performance  reductions  for  variants  compared  to  RecSSN 
are  shown  in  Table  5.5.  When  eliminating  the  effect  of  global  information  of  distrust 
links  from  the  proposed  framework,  the  performance  of  RecSSN\GN  degrades.  We 
make  a  similar  observation  for  RecSSN\LN  when  eliminating  the  effect  of  local  in- 
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(a)  AME  (b)  RMSE 


Figure  5.4:  Impact  of  Distrust  Links  on  The  Proposed  Framework  RecSSN. 


Variants 

50% 

70% 

90% 

MAE 

RMSE 

MAE 

RMSE 

MAE 

RMSE 

RecSSN\  GN 

-0.88% 

-1.02% 

-0.98% 

-1.21% 

-0.92% 

-1.15% 

RecSSN\LN 

-2.06% 

-3.06% 

-3.15% 

-2.71% 

-1.67% 

-3.21% 

RecSSN\  GN-LN 

-2.59% 

-3.29% 

-3.56% 

-3.22% 

-2.04% 

-3.56% 

Table  5.5:  Relative  Performance  Reductions  for  Variants  Compared  to  RecSSN. 


formation.  For  example,  compared  to  RecSSN,  RecSSN\ GN  and  RecSSN\LN  have 
1.02%  and  3.06%  relative  performance  reductions,  respectively,  in  terms  of  RMSE 
with  50%  of  Epinions  data.  When  eliminating  the  effects  of  distrust  links  from  global 
and  local  information  of  trust /distrust  networks,  RecSSN\  GN-LN  obtains  worse  per¬ 
formance  than  both  RecSSN\  GN  and  RecSSN\LN.  This  suggests  that  local  and  global 
information  contain  complementary  information  to  each  other  for  recommendation. 

With  the  results  from  Figure  5.4  and  Table  5.5,  we  can  answer  the  second  ques¬ 
tion  -  both  local  and  global  information  of  distrust  links  in  the  proposed  RecSSN 
framework  can  help  improve  the  recommendation  performance. 
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5.3  Conclusion 


In  this  chapter,  we  study  two  applications  of  distrust  -  node  classification  in 
trust /distrust  networks  and  recommendation  with  trust /distrust  networks.  The  re¬ 
search  results  suggest  that  (1)  negative  links  may  not  denote  dissimilarities;  (2)  the 
observation  that  users  should  site  closer  to  their  “friends”  than  their  “foes”  paves  a 
way  to  capture  trust /distrust  relations;  and  (3)  distrust  have  added  value  over  trust 
and  can  significantly  improve  the  performance  of  node  classification  and  recommen¬ 
dation. 
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Chapter  6 


GENERALIZING  FINDINGS  OF  DISTRUST 


Trust  is  a  special  type  of  positive  links,  and  many  properties  and  algorithms  of  trust 
can  be  generalized  to  positive  links  such  as  friendships  and  like  [87].  For  example, 
similar  properties  such  as  transitivity  and  homophily  have  been  observed  for  positive 
links  [99,  77];  trust  prediction  algorithms  perform  well  with  positive  links  [57,  77]; 
and  many  trust  application  frameworks  can  be  directly  applied  to  positive  links  [82, 
46].  Since  distrust  is  a  special  type  of  negative  links,  a  natural  question  here  is 
whether  we  can  generalize  some  properties  and  algorithms  of  distrust  to  negative 
links  such  as  foes  and  dislike.  This  investigation  can  greatly  expand  the  boundaries 
of  distrust  computing  and  make  research  achievements  be  applicable  to  a  wide  range 
of  applications. 

We  collect  a  dataset  from  Slashclot.  Slashdot  is  a  technology  news  platform  in 
which  users  can  create  friend  (positive)  and  foe  (negative)  links  to  other  users.  They 
can  also  post  news  articles.  Other  users  may  annotate  these  articles  with  their  com¬ 
ments  and  opinions.  Slashdot  users  can  associate  themselves  with  tags  and  join  some 
interest  groups.  Some  key  statistics  are  demonstrated  in  Table  6.7.  From  the  table, 
we  note  that  Slashdot  provides  sufficient  information  to  enable  this  investigation. 

In  the  following  sections,  we  will  investigate  whether  we  can  generalize  (1)  prop¬ 
erties,  (2)  distrust  prediction  algorithms  and  (3)  application  frameworks  of  distrust 
to  negative  links. 
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#  of  Users 

7,275 

#  of  Positive  Links 

67,705 

#  of  Negative  Links 

20,851 

#  of  Posts 

300,932 

#  of  Positive  Opinions 

1,742,763 

#  of  Negative  Opinions 

42,260 

#  of  Tags 

27,942 

#  of  Labels 

10 

Table  6.1:  Statistics  of  the  Slashdot  Dataset. 

6.1  Negative  Link  Properties 

Distrust  is  not  transitive,  highly  asymmetric  and  denotes  neither  similarity  nor 
dissimilarity.  In  this  section,  we  examine  these  properties  of  negative  links.  Note  that 
since  we  use  similar  methods  of  distrust  to  investigate  negative  links,  we  omit  details 
and  directly  present  results  and  observations. 

6.1.1  Transitivity 

The  results  of  transitivity  of  negative  links  are  shown  in  Table  6.2.  We  make 
similar  observations  as  -  (1)  positive  links  are  transitive;  (2)  negative  links  are  not 
transitive;  and  (3)  when  (u.rUj ,Uj-Uk) ,  it  is  also  likely  that  /ul+Uk1  which  can  be 
explained  by  balance  theory  as  “enemies’ enemies  are  friends” . 

6.1.2  Asymmetry 

We  show  the  results  of  asymmetry  of  negative  links  in  Table  6.3.  Positive  links 
are  asymmetric;  while  negative  links  are  highly  asymmetric.  These  observations  are 
consistent  with  those  for  distrust. 
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Trust 


Types 

Number 

PI 

P2 

4,197,533 

82.03% 

N.A. 

(Ui+UjjUj+Uk),  Ui+Uk 

898,905 

17.57% 

97.89% 

Uj Uk)  5  tli-ZLfc 

19,365 

0.4% 

2.11% 

Distrust 

Types 

Number 

PI 

P2 

777,586 

91.70% 

N.A. 

13,362 

1.67% 

54.07% 

(Uj-Uj  JlLj-Uh),  Ui-Uk 

11,351 

1.41% 

45.93% 

Table  6.2:  Transitivity  of  Positive  and 

1  Negative  Links  in  Slashdot. 

Uj+Ui(% ) 

Uj-Ui(% ) 

Uj?Ui(%) 

ZLi~\~ZLj 

59,965(31.62) 

556(0.29) 

129,121(68.09) 

UfUj 

556(1.69) 

2,055(6.26) 

30,218(92.05) 

Table  6.3:  Asymmetry  of  Positive  and 

6.1.3 

.  Negative  Links. 

Similarity 

We  investigate  similarities  between  pairs  with  positive  links,  negative  links  and 
no  links,  and  the  average  similarities  are  illustrated  in  Table  6.4.  We  observe  -  (1) 
pairs  with  positive  links  are  likely  to  be  similar;  (2)  pairs  with  negative  links  are  more 
similar  than  those  randomly  selected  pairs;  and  (3)  pairs  with  positive  links  are  more 
similar  than  those  with  negative  links.  Similar  observations  are  made  for  distrust  and 
trust  relations. 
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Cl 

COSINE 

CI-COSINE 

Foe  (sd) 

0.1926 

0.1926 

0.5577 

Friend  (st) 

0.2160 

0.2160 

0.6480 

Random  Pairs  (s.r) 

0.1759 

0.1759 

0.5385 

Table  6.4:  Similarity  for  Positive  and  Negative  Links. 

6.2  Negative  Link  Prediction 

We  make  a  number  of  observations  of  distrust  in  Chapter  4  and  based  on  these 
observations,  we  propose  an  unsupervised  framework  dTrust  and  a  supervised  frame¬ 
work  NeLP  to  predict  distrust  by  leveraging  two  sources  -  trust  and  content-centric 
user  interactions.  In  this  section,  we  first  investigate  whether  we  can  make  similar 
observations  for  negative  links  and  then  expand  dTrust  and  NeLP  to  predict  negative 
links  by  using  positive  links  and  content-centric  user  interactions. 

6.2.1  Analysis  on  Negative  Links 

We  compute  the  lengths  of  shortest  paths  of  pairs  with  negative  links  in  the 
positive  networks  and  the  length  distribution  is  demonstrated  in  Figure  6.1.  We 
make  similar  observations  -  more  than  50%  of  our  foes  are  within  2  hops  and  more 
than  80%  of  our  foes  are  within  3  hops. 

We  examine  all  triads  in  Slashdot  and  find  that  (1)  93.01%  of  triads  are  balanced; 
and  (2)  93.38%  of  triads  satisfy  status  theory. 

We  investigate  the  existence  of  correlation  between  negative  links  and  negative 
interactions  via  a  two  sample  t-test.  Evidence  from  t-test  suggests  that  there  is  a 
strong  correlation  between  negative  links  and  negative  interactions,  and  users  with 
negative  interactions  are  likely  to  have  negative  links.  The  distributions  of  ratios 
of  negative  links  with  respect  to  the  number  of  negative  interactions  are  shown  in 
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Figure  6.1:  Ratio  Distributions  of  the  Length  of  Shortest  Path  for  Pairs  with  Neg¬ 
ative  Links  in  the  Positive  Networks. 
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Figure  6.2:  The  Ratios  of  Negative  Links  with  respect  to  the  Number  of  Negative 
Interactions. 

Figure  6.2.  The  ratio  of  a  randomly  selected  pair  as  a  negative  link  is  3.9402e  —  04 
in  Slashdot.  Even  when  the  numbers  of  negative  interactions  are  small,  the  ratios 
are  much  higher  than  the  random  one,  which  further  supports  that  existence  of  the 
correlation.  Furthermore  with  increase  of  the  number  of  negative  interactions,  the 
ratios  tend  to  increase.  Therefore,  an  increase  in  the  number  of  negative  interactions 
increases  the  likelihood  of  negative  links  between  users. 

Aforementioned  analysis  on  negative  links  suggests  that  observations  of  distrust 
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Slashdot50 

Slashdot70 

SlashdotlOO 

neglnter 

0.0323 

0.0819 

0.0997 

random 

0.00001 

0.00002 

0.00003 

dTrust 

0.0754 

0.1372 

0.1491 

— -neglnter 

- random 

dirust 

Slashdot50  Slashdot70  SlashdotlOO 

Figure  6.3:  Performance  of  dTrust  in  Predicting  Negative  Links  in  Slashdot. 


can  be  generalized  to  negative  links,  which  indicates  that  the  proposed  frameworks 
dTrust  and  NeLP  for  distrust  prediction  may  be  generalized  to  predict  negative  links. 

6.2.2  dTrust  for  Negative  Link  Prediction 

Following  similar  experimental  settings  to  assess  dTrust  in  Epinions,  we  evaluate 
dTrust  on  Slashdot  and  the  performance  is  illustrated  in  Figure  6.3.  neglnter  obtains 
much  better  performance  than  random,  which  suggests  the  existence  of  the  correlation 
between  negative  links  and  negative  interactions.  Performance  comparison  between 
dTrust  and  random  suggests  that  dTrust  can  accurately  predict  negative  links,  which 
indicates  that  dTrust  can  be  generalized  to  negative  link  prediction. 

6.2.3  NeLP  for  Negative  Link  Prediction 

We  generalize  NeLP  to  predict  negative  links  by  using  positive  links  and  content¬ 
centric  user  interactions  in  Slashdot  and  the  performance  is  shown  in  Table  6.5. 
neglnS  always  outperforms  negln,  which  indicates  that  status  theory  is  helpful  in 


Algorithms 

Slashdot 

FI 

Precision 

random 

0.0008 

0.0004 

sPath 

0.0090 

0.0172 

negln 

0.1986 

0.1483 

neglnS 

0.2072 

0.1524 

NeLP-negln 

0.2394 

0.2083 

NeLP 

0.2441 

0.2139 

Table  6.5:  Performance  of  NeLP  in  Predicting  Negative  Links  in  Slashdot. 

negative  link  prediction.  Compared  to  random,  NeLP  obtains  much  better  perfor¬ 
mance,  which  suggests  that  NeLP  can  be  generalized  to  negative  link  prediction. 

Because  the  classifier  learned  by  NeLP  is  based  on  the  same  set  of  features  ex¬ 
tracted  from  pervasively  available  sources  for  most  social  media  sites,  it  is  possible  to 
generalize  the  classifier  learned  in  one  site  to  other  sites  and  we  further  investigate 
how  well  the  learned  classifier  generalizes  across  social  media  sites.  In  particular,  we 
evaluate  the  performance  of  the  classifier  on  Epinions  (or  Slashdot),  which  is  learned 
from  Slashdot  (or  Epinions).  The  results  are  shown  in  Figure  6.4.  Note  that  in  the 
figure  x  — y  y  denotes  training  on  x  and  evaluating  on  y.  These  results  show  that 
there  is  very  good  generalization  of  the  classifier  learned  by  NeLP  although  there 
is  remarkably  little  decrease  in  performance  regardless  of  which  dataset  is  used  for 
training. 


6.3  Negative  Link  Applications 

In  Chapter  5,  we  propose  a  node  classification  framework  NCSSN  to  infer  labels  of 
unlabeled  nodes  by  leveraging  labeled  nodes  and  trust /distrust  networks,  and  a  rec- 


(a)  Epinions  (b)  Slashdot 


Figure  6.4:  The  Negative  Link  Prediction  Performance  across  Epinions  and  Slash- 
dot. 


ommendation  framework  RecSSN,  which  exploits  trust /distrust  networks  to  improve 
recommendation  performance.  In  this  section,  we  investigate  the  generalization  of 
NCSSN  and  RecSSN  from  trust /distrust  networks  to  signed  social  networks  (or  net¬ 
works  with  positive  and  negative  links). 

6.3.1  NCSSN  for  Node  Classification  in  Signed  Social  Networks 

Similar  to  evaluation  of  NCSSN  in  Epinions,  we  assess  NCSSN  in  Slashdot.  Users 
in  Slashdot  can  join  in  some  interest  groups  and  these  group  identifiers  are  treated 
as  the  class  labels.  Via  cross  validation,  the  parameters  for  NCSSN  are  set  as  {a  = 
l,/3  =  0.5,  K  =  500}  and  the  experimental  results  are  demonstrated  in  Table  6.6. 
We  observe  that  -  (1)  sICAS  and  sICAd  outperform  ICA  and  negative  links  can 
improve  node  classification  performance;  (2)  disGReg  performs  worse  than  GReg, 
and  negative  links  may  not  denote  dissimilarities;  and  (3)  performance  comparison 
between  NCSSN  and  Random .  indicates  that  NCSSN  can  significantly  improve  node 
classification  performance  with  signed  social  networks.  All  these  observations  suggest 
that  NCSSN  can  be  generalized  to  the  problem  of  node  classification  in  signed  social 
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Slashdot 

Algorithms 

5% 

10% 

15% 

20% 

ICA 

19.99 

20.86 

21.25 

21.34 

sIC A3 

21.46 

21.68 

22.31 

22.75 

sICA4 

22.24 

22.30 

22.85 

23.13 

GReg 

19.57 

20.91 

21.23 

22.03 

disGReg 

18.93 

19.85 

20.54 

21.19 

sGReg 

20.56 

22.56 

22.77 

22.90 

NCSSN 

23.54 

24.66 

25.20 

25.62 

Random 

8.33 

8.31 

8.34 

8.34 

Table  6.6:  Performance  Comparison  of  Node  Classification  in  Slashdot. 


networks. 


6.3.2  RecSSN  for  Recommendation  with  Signed  Social  Networks 


In  Slashdot,  users  are  associated  with  certain  tags  and  the  recommendation  task 
is  to  recommend  tags  to  users.  In  this  scenario,  the  performance  is  often  evaluated 
via  precision@N  and  recall@N  [67],  which  are  formally  defined  as  follows: 


precision@N  = 


Y.^u\T«pNiOh 


recall@N  = 


\TopN, 
S-.aJropVrVil 
Yhm&A  l-^l 


where  TopNi  is  the  set  of  N  items  recommended  to  user  Ui  that  has  not  been 
associated  in  the  training  set,  and  R  is  the  set  of  items  that  have  been  associated 
with  Ui  in  the  testing  set.  A  larger  precision@N  or  recall@N  value  means  better 
performance.  The  values  of  precision@N  and  recall@N  are  usually  small  in  the  case 
of  sparse  datasets.  For  example,  the  precision@5  is  less  than  0.05  over  a  dataset  with 
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Metrics 

Memory-based  Methods 

Model-based  Methods 

UCF 

pUCF 

pnUCF 

MF 

SocialMF 

SoReg 

LOCABAL 

disSoReg 

RecSSN 

P@5 

0.0343 

0.0372 

0.0381 

0.0354 

0.0387 

0.0386 

0.0394 

0.0379 

0.0419 

R@5 

0.0438 

0.0479 

0.0485 

0.0453 

0.0492 

0.0488 

0.0498 

0.0473 

0.0511 

P@10 

0.0332 

0.0358 

0.0364 

0.0338 

0.0365 

0.0368 

0.0375 

0.0359 

0.0388 

R@10 

0.0413 

0.0454 

0.0463 

0.0427 

0.0463 

0.0467 

0.0479 

0.0457 

0.0497 

Table  6.7:  Comparison  of  Different  Recommender  Systems  in  Slashdot. 


8.02e  —  3  density  [17,  18].  In  this  work,  we  set  IV  =  5  and  N  =  10. 

The  recommendation  performance  is  illustrated  in  Figure  6.7.  Note  that  we  set 
f3  =  0.3  via  cross  validation.  It  is  observed  that  (1)  disSoReg  obtains  worse  per¬ 
formance  than  SoReg  and  we  may  not  consider  negative  links  as  dissimilarities  in 
recommendation;  and  (2)  RecSSN  can  significantly  improve  recommendation  per¬ 
formance  by  exploiting  signed  social  networks  compared  to  baseline  methods.  These 
observations  suggest  that  the  recommendation  framework  RecSSN  can  be  generalized 
from  trust/distrust  networks  to  signed  social  networks. 

6.4  Conclusion 

Some  properties  and  algorithms  of  trust  can  be  generalized  to  positive  links,  which 
motivates  us  to  study  whether  we  can  generalize  findings  of  distrust  to  negative  links. 
The  aforementioned  investigations  suggest  that  (1)  similar  properties  are  observed 
for  negative  links  as  distrust;  (2)  distrust  prediction  frameworks  dTrust  and  NeLP 
can  accurately  predict  negative  links;  and  (3)  application  frameworks  NCSSN  and 
RecSSN  can  be  generalized  from  trust /distrust  networks  to  signed  social  networks 
(or  networks  with  positive  and  negative  links). 
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Chapter  7 


CONCLUSION  AND  FUTURE  WORK 


In  this  chapter,  we  summarize  our  research  results  and  their  broader  impacts,  and 
discuss  promising  research  directions. 

7.1  Summary 

In  this  dissertation,  we  propose  four  innovative  research  tasks  -  (1)  understanding 
distrust;  (2)  predicting  distrust;  (3)  applying  distrust  and  (4)  generalizing  findings  of 
distrust. 

For  understanding  distrust,  we  investigate  properties  of  distrust  and  find  that  we 
can  not  extend  properties  of  trust  to  distrust  and  distrust  presents  distinct  properties. 
The  computational  task  of  predicting  distrust  from  only  trust  suggests  that  we  can 
not  predict  distrust  from  only  trust  hence  distrust  is  not  the  negation  of  trust  in 
social  media;  while  the  computational  task  of  predicting  trust  with  information  from 
distrust  indicates  that  distrust  can  significantly  improve  trust  prediction  performance 
hence  distrust  has  added  value  over  trust. 

For  predicting  distrust,  we  formally  define  the  problem  and  make  a  number  of 
important  hirelings  about  distrust  -  (1)  our  “foes”  are  close  to  us  in  the  trust  network; 
(2)  most  triads  satisfy  balance  and  status  theories;  (3)  there  is  a  strong  correlation 
between  distrust  and  negative  interactions;  and  (4)  negative  interactions  between 
users  increase  the  propensity  of  distrust.  These  findings  serve  as  the  groundwork 
of  an  unsupervised  framework  dTrust  and  a  supervised  framework  NeLP,  which  can 
predict  distrust  accurately  by  leveraging  trust  and  content-centric  user  interactions. 

For  applying  distrust,  we  propose  principled  approaches  to  model  distrust  in  two 


93 


representative  social  media  applications,  i.e. ,  node  classification  and  recommendation. 
The  successful  experiences  of  applying  distrust  in  node  classification  and  recommen¬ 
dation  suggest  that  (1)  distrust  may  not  denote  dissimilarity  although  trust  denotes 
similarity;  and  (2)  distrust  has  potentials  in  improving  the  performance  of  social 
media  applications. 

For  generalizing  findings  of  distrust,  we  investigate  the  generalization  of  proper¬ 
ties  and  algorithms  of  distrust  to  negative  links.  We  find  that  (1)  negative  links  show 
similar  properties  as  distrust;  (2)  distrust  predication  frameworks  dTrust  and  NeLP 
can  accurately  predict  negative  links;  and  (3)  the  node  classification  framework  NC- 
SSN  and  the  recommendation  framework  RecSSN  can  be  successfully  expanded  for 
negative  links. 

This  dissertation  investigates  original  problems  that  entreat  unconventional  data 
mining  solutions.  They  are  challenging  because  distrust  is  often  not  available  in  social 
media  and  they  are  original  because  little  is  known  about  distrust  and  its  role  in  social 
media  applications.  Methodologies  and  techniques  presented  in  this  dissertation  also 
have  broader  impacts: 

•  Data  availability  is  still  a  challenging  problem  for  social  scientists  [98].  Social 
media  provides  a  virtual  world  for  users  online  activities  and  makes  it  possible 
for  social  scientists  to  observe  social  behavior  and  interaction  data  of  hundreds 
of  millions  of  users.  Our  successful  experiences  of  using  social  media  data  to 
study  the  social  concept  distrust  pave  the  way  for  new  research  endeavors  to 
enable  the  large-scale  study  of  user  behaviors  in  social  media  in  computational 
social  science. 

•  The  enabling  of  distrust  in  social  media  and  the  successfully  applying  distrust  in 
social  media  applications  not  only  can  have  impact  on  industrial  IT  applications 
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by  improving  services  and  user  experience  but  also  will  open  doors  to  new 
opportunities  of  research  and  development  involving  social  media. 

•  Social  theories  are  useful  to  explain  user  behaviors  in  social  media  and  play  im¬ 
portant  roles  in  helping  distrust  computing  in  social  media.  The  techniques 
of  modeling  social  theories  may  be  directly  applied  to  social  media  mining 
tasks  [102]  and  their  success  manifests  a  new  research  direction  -  mining  so¬ 
cial  media  data  with  social  theories. 

7.2  Future  Work 

Computing  distrust  in  social  media  is  still  in  its  early  stages  of  development  and 
an  active  area  of  exploration.  Below  we  present  some  promising  research  directions: 

•  Distrust  Prediction  with  Cross-media  Data:  Our  previous  study  suggests 
that  the  learned  distrust  predictor  by  NeLP  have  very  good  generalization  across 
social  media  sites,  which  suggests  not  only  that  some  underlying  general  prin¬ 
ciples  guide  the  creation  of  distrust  relations  but  also  that  cross-media  data 
has  potentials  in  distrust  prediction.  The  key  issue  of  the  problem  of  distrust 
prediction  with  cross-media  data  is  how  to  transfer  knowledge  or  patterns  from 
distrust  in  the  source  site  to  the  target  site.  This  would  be  an  application 
scenario  of  transfer  learning  [62];  hence,  we  will  investigate  under  what  circum¬ 
stances,  transfer  learning  algorithms  are  applicable  for  distrust  prediction  with 
cross-media  data. 

•  Evaluation  without  Ground  Truth:  Evaluations  in  this  dissertation  are 
based  on  datasets  with  ground  truth.  However  distrust  is  usually  unavailable  in 
social  media  and  then  the  chief  challenge  of  evaluation  is  related  to  the  invisi¬ 
bility  of  distrust  -  how  to  verify  the  correctness  of  predicted  distrust  when  they 
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are  invisible.  We  can  take  a  multipronged  approach  to  evaluation  challenges. 
Traditionally,  training  and  test  datasets  are  used  in  evaluation;  in  such  as  a 
case,  we  say  there  is  ground  truth.  When  these  datasets  are  not  available,  user 
studies  are  conducted  (e.g.,  employing  Amazon  Mechanical  Turks).  We  also 
can  try  user  studies  as  in  a  study  [100]  to  verify  the  performance  of  predicting 
strong  and  weak  ties  on  Facebook.  In  essence,  this  method  would  rely  on  a 
group  of  recruited  subjects  who  donate  their  data  but  withhold  distrust  and 
then  compare  predicted  distrust  against  their  withheld  distrust. 

•  Putting  Distrust  into  More  Social  Media  Applications:  We  use  node 
classification  and  recommendation  as  examples  to  illustrate  that  distrust  can 
significantly  improve  their  performance.  The  exciting  progress  not  only  proves 
the  importance  of  distrust  but  also  suggests  that  we  should  put  distrust  into 
more  social  media  applications.  The  enabling  and  generalizing  of  distrust  fur¬ 
ther  broaden  its  applications.  We  will  investigate  how  to  apply  distrust  in  more 
social  media  applications  such  as  data  clustering,  active  learning,  information 
propagation,  sentiment  analysis  and  feature  selection  [74], 
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APPENDIX  A 


AN  OPTIMIZATION  ALGORITHM  FOR  DISTRUST 
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Set  A  =  F  —  ArrT  and  let  C  contain  terms  related  to  U  and  H  in  the  objective 
function  J  of  Eq.  (4.9),  which  can  be  rewritten  as, 

C  =  TV(— 2(W  ©  W  0  A)UHtUt  +  (W  0  W  0  UHUT) 

UHTUT)  +  a(||U|||  +  ||V|||)  (A.l) 

the  partial  derivations  of  U  and  H  with  respective  to  J  can  be  obtained  from  C  are 

1  d  J  _  1  dC 

2«9U  ~~  2dU  ~~ 

-  (W  0  W  ©  A)UHt  -  (W  ©  W  0  A)tUH  +  q;U 

+  (W  0  W  ©  UHUt)UHt  +  (W  0  W  0  UHUt)tUH. 

1  dj  _  1  dC 

2«9H  ““  29H  ““ 

-  UT(W  0  W  0  A)U  +  UT(W  ©  W  ©  UHUt)U  +  O'H  (A.2) 

Set  B  =  G  —  UHUt  and  let  Cr  contain  terms  related  to  r  in  J ,  which  can  be 
rewritten  as, 

Cr  =  Tr(- 2A(W  0  W  ©  B)rrT 

+  A2(W  ©  W  0  rrT)rrT)  +  o;||r |||  (A. 3) 

then  the  partial  derivation  of  r  with  respect  to  J  is 

1  dj  _  1  d  Cr 

2  dr  2  dr 

-  A(W  ©  W  ©  B)r  -  A(W  0  W  ©  B)Tr  +  ar 

+  A2(W  0  W  0  rrT)r  +  A2(W  ©  W  0  rrT)Tr  (A.4) 

With  the  partial  derivations  of  U,  H.  and  r,  a  optimal  solution  of  the  objective 
function  in  Eq.  (4.9)  can  be  obtained  through  a  gradient  decent  optimization  method 
as  shown  in  Algorithm  3. 

Next  we  briefly  review  Algorithm  3.  In  line  1,  we  construct  the  trust  and  pseudo 
distrust  relation  matrix  F  and  its  weight  matrix  W  from  user-user  trust  relations  T, 
user-review  authorship  relations  P.  and  user-review  helpfulness  ratings  R.  From  line 
3  to  line  8,  we  update  U,  H  and  r  until  convergence  where  7^  and  are  learning 
steps,  which  are  chosen  to  satisfy  Goldstein  Conditions  [60].  After  learning  the  user 
preference  matrix  U,  FI  and  r  via  Algorithm  3,  the  reconstructed  trust  and  distrust 
matrix  is  F  =  UHUT  +  ArrT.  Finally  we  predict  pairs  (ui,Uj)  whose  sign(Fij)  =  —  1 
as  a  distrust  relation  with  confidence  |Fq). 
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Algorithm  3  The  Proposed  Framework  d Trust. 

Input  :  User-user  trust  relations  T,  user-review  authorship  relations  P,  user-review 
helpfulness  ratings  R,  {d,  A}. 

Output  :  A  ranking  list  of  pairs  of  users. 

1:  Construct  W  and  F  from  T.  P.  and  R 
2:  Initialize  U,  H  and  r  randomly 
3:  while  Not  convergent  do 
4:  Calculate  and  ^ 

5:  Update  U  <—  U  —  7«§^j 

6:  Update  H  H  —  7/iffj 

7:  Update  r  r  —  7r§p 

8:  end  while 
9:  Set  F  =  UHUt  +  ArrT 
10:  Set  V  =  {(ui,Uj)\sign(Fjj)  =  —1} 

11:  Ranking  pairs  of  users  in  V  (e.g .,(«*, Mj))  according  to  |F|  (e.g.,  F,:j — )  in  a 
descending  order 
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APPENDIX  B 


AN  OPTIMIZATION  ALGORITHM  FOR  THE  SUPERVISED  FRAMEWORK 
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We  solve  the  optimization  problem  in  Eq.  (4.14)  based  on  the  dual  form  [4].  The 
classical  representer  theorem  states  that  the  solution  to  this  minimization  problem  of 
Eq.(  4.14)  exists  in  Hk  and  can  be  written  as  follows: 


w*  =  y>if(x,,x) 


Eq.(  4.14)  can  be  rewritten  as  follows: 


^  ^  (j 

min  -aTKa  +  Cp  e,  +  Cn  y  Cj€j  +  — aTK£K 

a,b,e  2  J  '  V 


a 


Ui&PS 


UjGNS 


S.t.  Vi(^2  akK(Xk,  Xj)  +  6)  >  1  -  Wj  G  PS 

k 

Vj{ akK(xk,  Xj)  +  5)  >  1  -  Uj  G  IV  S 


6j  >  0,  >  0 

where  K  is  the  Gram  matrix  over  all  samples. 
We  define  s,  for  x,-  as  follows: 


J  Gp  for  Xj  G  PS, 
|  CnCi  for  Xj  G  iVS. 


(B.l) 


(B-2) 


(B.3) 


After  the  introduction  of  two  sets  of  multipliers  f3  and  7,  the  Lagrangian  function 
of  Eq.(  B.2)  is  as  follows: 


1 

L( w,  b ,  e,  a,  7)  =  -aT( K  +  C\,KCK)a  +  SjCj 

2=  1 

l  l 

~  VI  A[yi(y^afc-^(xfc,Xj)  +  6)  -  1  +  ej]  -  y,  7tea  (B.4) 

2=1  k  2=1 

where  (3  and  7  are  Lagrange  multipliers. 

To  obtain  the  dual  representation,  we  set 

dL  z 

y  A?/* = 0 
2=1 
dL 

—  =  0  =>  Si  -  (3i  -  nfi  =  0  =>  0  <  f3i  <  Si  (B.5) 

OEi 

With  Eq.  (B.5),  we  can  rewrite  the  Lagrangian  as  a  function  of  only  a  and  (3  as 
follows: 


1 

L(a,P)  =  -«t(K  +  GK£K)«  -  «TKJTY^  + 

2=1 


(B.6) 
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in  Eq.  (B.6),  J  =  [I  0]  where  I  is  an  l  x  l  identity  matrix  and  0  is  a  l  x  p  rectangular 
matrix  with  all  zeros,  and  Y  is  a  l  x  l  diagonal  matrix  composed  by  labels  of  samples 
in  PS  and  NS. 

By  setting  =  0,  we  obtain 

a={  I  +  ChKC)-1  JtY/3  (B.7) 

After  substituting  back  in  the  Lagrangian  function,  we  obtain  the  dual  problem 
as  a  quadratic  programming  problem: 


max 

P 


Eft 


s.t. 


i 

^2  PiVi  =  0 

i=  1 


o  <  A  <  St 


where  Q  is  defined  as  follows: 

Q  =  YJK(I  +  Cilifr'.rY 


(B.8) 


(B.9) 
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Let  C  be  the  Lagrangian  function  as: 

C  =  /( H4,  Hd,  U,  W)  -  Tr((Au)TU) 

-  Tr(( A/f*)THt)  -  Tr((AHd)THd)  (C.l) 

where  /( Hf,Hd,U,W)  is  the  objective  function  of  Eq.  (5.11).  The  notations  Au , 
AH f  and  AH  are  the  Lagrangian  multipliers  for  non- negativity  of  U,  H4  and  H'l 
To  compute  U7  we  fix  HL  HcZ  and  W.  The  derivative  of  C  with  respect  to  U  is 
as  follows: 


—  =  E  —  B  -  Au  (C.2) 

where  the  matrices  B  and  E  are  defined  as  follows: 

B  =  (T)tUH4  +  TU(H4)t  +  a((CYWT)+ 

+  (CUWWT)~)  +  /f((D)TUHd  +  DU(H(I)t) 

+  /3(  E  fiA  M^'U  +  U(Mijfe)T)) 

(i,j,k)es 

E  =  U(H4)TUTUH4  +  UH'UTU(Hf)T  +  a((CUWWT)  + 

+  (CYWt)“)  +  l5(U(Hl/)TUTUH‘i  +  UH,;UTU(Hd)) 

+d  E  fijk(MijkU  +  U(M*jfc)T)) +  +  AU  (C.3) 

(i,j,k)es 


where  for  any  matrix  X,  (X)+  and  (X)-  denote  the  positive  and  negative  parts  of  X, 
respectively. 

Setting  Jg  =  0  and  using  the  KKT  complementary  condition  UgAL  =  0,  we  can 
derive  the  update  rule  for  U  as  follows: 


U  ij  i  U  ij 


(C.4) 


To  compute  H4,  we  fix  U7  and  W.  The  derivative  of  C  with  respect  to  H4  is 
as  follows: 

clC 

—  =  UtUH'UtU  +  AH4  -  UtTU  -  AHt  (C.5) 

oH* 

Setting  Jgf  =  0  and  using  the  KKT  complementary  condition  HE  A  f*  =  0,  we 
can  get  the  update  rule  for  H4  as  follows: 


Hi,  4r-  H 


[UTTU 


v 


ij  i 


[UTUH4UTU  +  AH4 


(C.6) 
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Similarly,  we  can  obtain  the  update  rule  for  Hf/  as  follows: 


H 


UTDU]  —  ij 


[UTUH'4UTU  +  AHc/] 


(C.7) 


y 


Setting  =  0,  we  obtain  the  following: 


W=  (U'CU  +  AI)"1UtCY 


(C.8) 


With  update  rules  for  U,  H4,  Hrf  and  W,  the  detailed  algorithm  for  NCSNN  is 
shown  in  Algorithm  4.  Next,  we  give  a  brief  description  of  Algorithm  4.  In  line  1, 
we  construct  the  set  S  and  then  we  initialize  U,  H4.  EC  and  W  randomly.  From  line 
3  to  line  10,  we  update  U,  H4.  H,(  and  W  according  to  update  rules  in  Eqs.  (C.4), 
(C.6),  (C.7)  and  (C.8). 


Algorithm  4  The  Node  Classification  Framework  in  Trust/Distrust  Networks. 
Input:  T,  D,  Y  and  {a,  f3,  A} 

Output:  A  linear  classifier  W  and  the  user  latent  factor  matrix  U 
1:  Construct  S  as  S  =  {( i,j ,  k)\T.jj  =  1  A  =  1} 

Initialize  U,  H4,  H4  and  W  randomly 
while  Not  convergent  do 


2: 

3 

4: 

5: 

6 

7: 

8: 

9: 

10: 

11 

12 

13: 

14: 

15 

16 

17 

18: 

19: 

20: 


for  {i,j,  k)  G  S  do 

Construct  fl3k  and  M,jfc 

end  for 

Construct  B  and  E  as  Eq.  (C.3) 
for  i  from  1  to  A  do 
for  j  from  1  to  K  do 

U  ij  ^  • 

end  for 
end  for 

for  i  from  1  to  K  do 
for  j  from  1  to  K  do 


HA 


^  HA 


[UTTU], 


<-  I  I 


■trd 

end  for 
end  for 
W  =  (UTCU  - 
end  while 


[UTUHlUTU+AH<]y 
[UTDU]-ij 
[UT  UHdUT  U+AHd]i 


AI)_1UTCY 
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We  define  at  the  fc-th  iteration  for  rq  as  follows: 


M? 


lUi 


TJP||2  _  NTT.  _ 
uill2  llu« 

otherwise 


u? 


>  o 


(D.l) 


Then,  we  use  J  to  denote  the  objective  function  of  Eq.  (5.21)  in  the  fc-th  iteration 
as  follows: 


N  m 

3  =  £ £  j(W„,  wOlKRij  -  U4VT)||| 

i=l  j  1 

N  m.  N 

+  «(£  null!  +  £  IIWII!)  +  D  £  M‘( 

i= 1  j= 1  i= 1 

E  pp  si7u7-  E,  si7-u7- 

l|Uj  -  “jgPi  q  J||^  -  ||  Ui  -  VA  „  -||1 
The  derivatives  of  77  with  respect  to  Uj  and  Vj  are  as  follows: 


f)  <T 

,u  .  =  -2^//iW;/.W;)(I?,/  -  U,;Vj)V,  +  2o:Uj 

3 

+  2/3M?(Ui-  -  Uf)  -  2 0Mj(Ui  -  U”) 

-2/3  £  M*(U,-Up  1 

„7i£(  s3-i 

+  2/3  £  M*(U,  -  U")  1 

Uj&Afi  /-/Uj&Mi  ji 

8  7 

"  =  2^//(W,/,W;)(H;;  -  UiVTjUi  +  2nV, 


(D.2) 


(D.3) 


The  detailed  algorithm  is  shown  in  Algorithm  5.  In  Algorithm  5,  and  ryv  are 
learning  steps,  which  are  chosen  to  satisfy  Goldstein  Conditions  [60].  Next,  we  briefly 
discuss  the  algorithm.  In  line  1,  we  initialize  latent  factors  of  users  U  and  items  V 
randomly.  In  each  iteration,  we  calculate  Uf ,  U”  and  M*:  for  Ui  from  line  3  to  line  6. 
From  line  7  to  line  9,  we  update  U  and  V  using  aforementioned  update  rules. 
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Algorithm  5  The  Proposed  Recommendation  Framework  RecSSN  with  Signed  Social 

Networks. _ 

Input:  The  rating  information  R,  trust  links  T.  negative  links  D  the  number  of 
latent  factors  K  and  f3 

Output:  The  user  preference  matrix  U  and  the  item  characteristic  matrix  V 
1:  Initialize  U  and  V  randomly  and  set  k  —  1 
2:  while  Not  convergent  do 
3:  for  i  =  1  :  N_  do 

4:  Calculate  Uf  and  U”  according  to  Eq.  (5.18) 

5:  Calculate  Mf  according  to  Eq.  (D.l) 

6:  end  for 

7:  Calculate  and 

8:  Update  U  •(—  U  — 

9:  Update  V  t-  V  - 

10:  k  =  k  +  1 

11:  end  while 
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